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Sequences producing significant alignments: 
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aiL 



gi 
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ai 



9^ 



g^ 



i.9A!^!?M]:i:!^^'JLl^£-.y5^ chondroitin betal,4 N-acetylga. 

i^ggggJ jpj I BAC16217 . 1 1 beta- 1 , 4 -N-acetylgalactosaminyl . 

2634S685 | dbj | BAC379B2 . 1 | unnamed protein product [Mus mu . 

31559877 I ret I NP 766341,2 | RIKEN cDNA 4732435N03; betal,4. 



?d^A?:l§3Al.^9j}^^^?2:§,9^^^^^ unnamed protein product [Mus mu. 

96J5 K)83 [db j '[ 3^ b63 5 5 4"T[ hypothetical protein [Macaca fas. 



34S77724 



l9910 



24429592 



31560032 



26366533 



47227595 



38174240 



26340498 



re 



dbj 



etno 



XP 224757.2 



7023813 I dbj |BAA92093 .: 



20072996 



3485S369 
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16537 



n.l98419 



34365000 



24652379 



11360289 



18677088 
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gi 4 1529141 
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ai 
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27681143 



28411190 



similar to chondroitin betal,4. 

.^Z.?Zi?.9.™iL.. unnamed protein product [Tetrao, 
NF 061060.3 I chondroitin betal,4 N-acetylga. 
NP 084441.3 I chondroitin sulfate GalNAcT-2 . 
F^dF^?J:2§l.:.-^^L... unnamed protein product [Mus mu . 
,£!^?l9.2,^1?A:J:L„. unnamed protein product [Tetrao. 
gbjAAH60772 . 1 | ChGn protein [Homo sapiens] 
dbj I B7\C33912 . 1 1 unnamed protein product [Mus mu. 

unnamed protein product [Homo sa. 
SpAd^^A^.!^l^A:2l.... 4732435N03Rik protein [Mus muscu . 
XP__232316 . 2 I similar to chondroitin betal,4. 

iA-. i.L . unnamed protein product [Tetrao, 
X P 'l'Q'srsTV'gJ^ ENSANGP00000021559 [Anopheles . 
CAE46d36 . 1 [ hypothetical protein [Homo sapi . 
l^P 610567.1 1 CG12913-PA [Drosophila melanog. 
|T46 919 hypothetical protein DKFZp762L037 . 1 . , 
BA.B85092 . 1 1 unnamed protein product [Homo s. 
?:?^M5.?A?i.:J:.L.. hypothetical protein [Homo sapi . 
gb I AA1-I60I69 . 1 [ Galnact2 protein [Mus musculus] 
emb|CAF879Q6.lf unnamed protein product [Tetrao. 
dbj |a^C98S32.l| chondroitin sulfate synthase [H. 
I:^±l^S^2I^^^^£l..,. chondroitin sulfate synthase 3. 
emb| CAE173 26 . 1 | chodroitin synthase 2 [Homo sap. 
ref|XP 225912.1 1 similar to carbohydrate (chond. 
emb I CAD43233 . 1 | chondroitin synthase 2 [Homo sa. 



dbj 



38086850 



2.?i^.?21llgjlL:^:^^^^ PRO0082 [Homo sapiens] 

rer! | XP 1 943 58 . 2 1 similar to mKIAA0990 protein [. 
dbj |B7\C9S'065. 1| mKIAA0990 protein [Mus musculus] 
gb|AAI-I23112 . 1 1 Galnact2 protein [Mus musculus] 



37360174 



18606381 



47086685 



47213615 



40789012 



L 7453 



31542309 



37182181 



34S57503 
41210270 



48 13 930 J 



42662609 



31158517 



45555191 



45555180 



L5S515 



1750923 9 



33300399 



ret 



emt 



dbj 



dbj 



rer 



NP 997843. 



C^F9S9S6riT 



BAA76 834 .2 I 



RAB64 936_. l | _ 



NP 055733.2 



gb|AAQ8S893 . : 



ref 1 XP_2 1 B 7 5 9 . 2 I 
r£i:_jXP__2i0054 .3j 



carbohydrate (chondroitin) syn. 
unnamed protein product [Tetrao. 
KIAA0990 protein [Homo sapiens] 
chondroitin synthase [Homo sapi . 
carbohydrate (chondroitin) syn. 
CHSYl [Homo sapiens] 

similar to carbohydrate (chond. 
similar to dJ341D10.1 (novel p. 
similar to chondroitin synthas . 
ref |XP 378123.1 1 similar to dJ19Nl.l (novel pro. 
gb I AA08 52 7 5 . l"| ^chondroitin synthase-like protei . 
ref I NP 996440.1 1 CG9220-PC [Drosophila melanoga. 
rel: |NP 9 96439 .l| CG9220-PB [Drosophila melanoga. 

chondroitin disaccharide polymer. 

chondroitin synthase [Caenorhab. 
chondroitin disaccharide polymer, 
chondroitin disaccharide polym. 
C. elegans SQV-5 protein (corre. 



ref |XP 396991.1 1 



Jl 15 S?lT'"gb [>50S 52 73\'l^ 
31044^74 Idib^^ 



gb[AA0B5274 . 
rei: 
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NP 4 92630.1 
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39595217 



31200513 



3486'; 



3S566700 



21757529 



38566692 



47077041 



34861210 



24416540 



40789255 



46275756 



re 



dbj 



ret 



dbj 



re i: 



CAE60254.1| 



XP 309203. 



XP 3428'i8 . 1 



NP 942585.1 1 



RA.C05141.ll 



NP 775864.2 I 



R^D18454.1 



X.P _ 219 477^2 I 



gb 1 7\7\H3 888 1 . 1 



1655370S ^ 
9229926jdi 



47210467 
47217479 



3718193 



47215238 



27545323 



12053139 



10439162 



J 9644 83 7 



23594326 



48525359 



27685255 



47224136 



28972740 
34353 973 



18043971 



1755437S 



32565140 



395S5100 
19922240 



34857950 



28958144 



Hypothetical protein CBG03828 [. 
ENSANGPOO 0000053 93 [Anopheles , 
similar to chondroitin betal,4. 
beta 1, 4-N-acetylgalactosaminy . 
unnamed protein product [Homo s . 

beta 1 , 4 -N-acetylgalactosaminy , 
unnamed protein product [Homo s. 
similar to hypothetical protei . 
BC038881 protein [Mus musculus] 
ref |np 848532.2 I betal , 4 -N-acetylgalactosaminyl . 
dbj |bad15101.i| betal , 4 -N-acetylgalactosaminyl t . 

unnamed protein product [Homo s. 
Not6 [Ciona intestinalis] 
unnamed protein product [Tetrao. 
unnamed protein product [Tetrao. 
RASL651 [Homo sapiens] 
unnamed protein product [Tetrao. 

chondroitin polymerizing facto, 
hypothetical protein [Homo sapi . 
!^pjA]^^h^^l!?A.^Jt:S^ [ unnamed protein product [Homo s. 
gb|A7\H0 8878.2 I CHPF protein [Homo sapiens] 

DNA segment, Chr 1, Brigham & . 
chondroitin polymerizing fa. 



dbj |RAB71566 . 
"3 BABO 0632 



erno 
emb 



CAF9423 0.1 
'CAGT02487i 



gb|7\AQS8769.l[ 



ref ?JP 078812.2 



emb|C^B66748.l| 



re 



ret: 



ret: 



emjo 



dbj 

re !: 



XP 129885.2 1 



NP 001001565. 



XP 237306. 



CAG13056.1 



similar to chondroitin polymer, 
unnamed protein product [Tetrao. 

?!^?i^57e.L^^^^^ mKIAA14 02 protein [Mus musculus] 
XP 216063 ,2]' similar to mKI7UV1402 protein [. 
gb|7A7\:ni9714.l| 2010209O12Rik protein [Mus muscu. 

y putative protein, with a coile. 

putative cytoplasmic protein, . 



re 



NP 498934 . ; 



NP 371671.1 



Hypothetical protein CBG06915 [. 
NP„i-^lQ946.1 j CG8536-PA [Drosophila melanoga. 
XF 227406.2 I similar to Lbcll protein [Ratt . 
gb I AAI-I47275 . 1 1 "Similar to RIKEN cDNA 2010209012. 
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J 87 9244 



go 



29421290 



7\7\H06589.1 



AAO59307.1 



15792460 



47227208 



emo 



NP 282283.1 



5305592 |gb |aaD41721 . 1 | N-acetylglucosamine galactosyltra . , 

mKIAA0651 protein [Mus musculus] 
:£ll!LL^I?_.^il?A]::L:J^^^ rho/rac guanine nucleotide exc, 
Rho/rac guanine nucleotide excha. 
Icinesin [Gibberella moniliformis] 

putative two-domain glycosyltr. 

ClAG00570.l| unnamed protein product [Tetrao. 

CSGlcA-T protein [Homo sapiens] 
KIAA1402 protein [Homo sapiens, 
chondroitin sulfate glucuronyl . 
KIAA1402 protein [Homo sapiens] 
beta- 1, 4 -galactosyltransf erase . 
NAD (P)H- flavin reductase [Shew. 



40353014 



42657991 
48717495 
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ref XP 3 76724. 



rer: NP 061888.1 



7243185 I dbj | B7\A9264Q . 1 1 
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NP 716140. 
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^ 1 1 NP 060S41 . 3 I chondroitin betal, 4 N-acetylgalactosaminyl trans f 

sapiens] 

alij:555^ii:^.Ld);:^^^^^^ chondroitin betal, 4 N-acetylgalactosaminyltransf erai 

sapiens] 

gi j 37182006 j gb j AAQ88806 . 1 j MMVR656 [Homo sapiens] 
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Length =532 
Score = 1075 bits (2780), Expect =0.0 

Identities = 521/521 (100%), Positives = 521/521 (100%) 

Query: 1 MIW/RRGLIAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGK^ 60 

MMIWRRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAV 
Sbjct: 1 MMIWRRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQ 60 

Query: 61 LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADL 120 

LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADL 
Sbjct: 61 LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADL 120 

Query: 121 LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 180 

LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 
Sbjct: 121 LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 180 

Query: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 240 

VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 
Sbjct: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 240 

Query: 241 LFRPFSPIMKVKNEKLNMANTLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVY 300 

LFRPFS P I MKVKNEKLNl^W^TLI NVI VPLAPO^VDKFRQFMQNFREMCI EQDGRVHLTVVY 
Sbjct: 241 LFRPFSPIMKVKNEKLNMANTLIWIVPLAKRVDKFRQFMQNFREMCIEQDGRW 300 

Query: 3 01 FGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 360 

FGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 
Sbjct: 301 FGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 360 

Query: 3 61 lYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGI lYGHHDAVPPLEQQLVIKKETGFWRD 420 

I YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGI I YGHHDAVPPLEQQLVI KKETGFWRD 
Sbjct: 361 lYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGI lYGHHDAVPPLEQQLVIKKETGFWRD 420 

Query: 421 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKR 480 

FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKR 
Sbjct: 421 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKR 480 

Query: 481 CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 521 

CMDELT PEQYKMCMQSKAMNEASHGQLGMLVFRHE I EAHLR 
Sbjct: 481 CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 521 

^ >gi I 22852208 | dbj | H?^.C16217 . 1 1 beta- 1 , 4 -N-acetylgalactosaminyltransf erase [Homo 

Length =532 

Score = 1074 bits (2777), Expect =0.0 

Identities = 520/521 (99%), Positives = 520/521 (99%) 

Query: 1 IWIVRRGLLAWISRVVVLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAV 60 

MMMVRRGLLAW I SRVWLLVLLCCAI S VLYMLACTPKGDEEQLALPRANS PTGKEGYQAV 
Sbjct: 1 ^MWRRGLLAWISRVVVLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAV 60 

Query: 61 LQEWEEQHRMYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADL 120 

LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADL 
Sbjct: 61 LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADL 120 

Query: 121 LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 180 

LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 
Sbjct: 121 LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 180 

Query: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 240 

VEAI ESALETLNNPAENS PNHRPYTASDF I EG I YRTERDKGTLYELTFKGDHKHEFKRL I 
Sbjct: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 240 

Query: 241 LFRPFSPIMKVKNEKLNMANTLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVY 300 
LFRPF PIMKVKNEKLNMANTLIWIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVY 
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Sbjct: 241 LFRPFGPIMICVKNEKLNMANTLIWIVPLAKRVDKFRQFMQNFREMCIEQIX3RW 300 

Query: 301 FGKEEIWEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 360 

FGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 
Sbjct: 301 FGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 360 

Query: 361 lYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRD 420 

I YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGI I YGHHDAVPPLEQQLVI KKETGFWRD 
Sbjct: 361 lYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRD 420 

Query: 421 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKR 480 

FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKR 
Sbjct: 421 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKR 480 

Query: 481 CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 521 

CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 
Sbjct: 481 CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 521 



LJ >gi I 2634 3685 I dbj | BA.C37982 . ij unnamed protein product [Mus musculus] 



Score = 1002 bits (2590), Expect =0.0 

Identities = 477/521 (91%), Positives = 496/521 (95%), Gaps = 4/521 (0%) 

Query: 3 MVRRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQ 62 

MVRRGLL WISRW+LLVLLCCAISVLYMLACTPKGD+EQL LPRAN PTGK+GYQAVLQ 
Sbjct: 1 MVRRGLLGWISRWILLVLLCCAISVLYMLACTPKGDQEQLGLPRANGPTGKDGYQAVLQ 60 

Query: 63 EWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGL--GLDRSPPEICrQADL 120 

E EEQHRNYV+SLKRQIAQLK+ELQ RSEQ R+GQ QASDA L G DR P K QADL 
Sbjct: 61 EREEQHRNYVNSLKRQIAQLKDELQARSEQFRSGQDQASDATSLRSGWDREP--KAQADL 118 

Query: 121 LAFLHSQVDKAEWAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 180 

LAFL QVDKAEV+AGVKliATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 
Sbjct: 119 LAFLRGQVDKAEVHAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 178 

Query: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 24 0 

VEAIESALE+LN+P E+SP+ RPYTA+DFIEGIYRTERDKGTLYELTFKGDHKHEF+RL+ 
Sbjct: 179 VEAIESALESLNSPVESSPHQRPYTAADFIEGIYRTERDKGTLYELTFKGDHKHEFQRLV 238 

Query: 241 LFRPFSPIMKVKNEKLNMANTLIWIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTV^ 300 

LFRPF PIMKVK EKliN+ANTLINVIVPLA+RVDKFR FMQNFREMCI +QDGRVHLTVVY 
Sbjct: 239 LFRPFGPIMKVKKEKLNLAlSrrLINVIVPLARRVDKFRHFMQNFREMCIQQDGRVHLTVVY 298 

Query: 301 FGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 3 60 

FGKEE+NEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 
Sbjct: 299 FGKEEMNEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 358 

Query: 361 I YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGII YGHHDAVPPLEQQLVI KKETGFWRD 420 

I YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPG+I YGHHDAVPPL QQLVI KKETGFWRD 
Sbjct: 359 I YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGV I YGHHDAVPPLGQQLVI KKETGFWRD 418 

Query: 421 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKR 480 

FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEK 
Sbjct: 419 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKH 478 

Query: 481 CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 521 

CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 
Sbjct: 479 CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 519 

□ >gi I 31559877 | ref | NP 766341 . 2 | RIKEN cDNA 4732435N03; betal,4 N-acetylgalactosa 



Length =530 



[Mus musculus] 
gi j 26329203 I d.bj j RAC283 40 . 1 1 



unnamed protein product [Mus musculus] 



Length =530 
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RID=1088011356-16594-155894407553.BLASTQ4, 



http;//ww.iK;bi.nlmnih.gov/BLAST/Blast.cgi 



Score = 1001 bits (2588), Expect = 0.0 

Identities = 476/521 (91%), Positives = 496/521 (95%), Gaps = 4/521 (0%) 

Query: 3 MVRRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQ 62 

MVRRGLL WISRW+LLVLLCCAISVLYMLACTPKGD+EQL LPRAN PTGK+GYQAVLQ 
Sbjct: 1 MVRRGLLGWISRWILLVLLCCAISVLYMLACTPKGDQEQLGLPRANGPTGKDGYQAVLQ 60 

Query: 63 EWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGL- -GLDRSPPEKTQADL 120 

E EEQHRNYV+SLKRQIAQLK+ELQ RSEQ R+GQ QASDA L G DR P K QADL 
Sbjct: 61 EREEQHRNYVNSLKRQIAQLKDELQARSEQFRSGQDQASDATSLRSGWDREP--KAQADL 118 

Query: 121 LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTraPEEKPVRKDKRDEL 180 

LAFL QVDKAEV+AGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 
Sbjct: 119 LAFLRGQVDKAEVHAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 178 

Query: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 24 0 

VEAIESALE++N+P E+SP+ RPYTA+DFIEGIYRTERDKGTLYELTFKGDHKHEF+RL+ 
Sbjct: 179 VEAIESALESVNSPVESSPHQRPYTAADFIEGIYRTERDKGTLYELTFKGDHKHEFQRLV 238 

Query: 241 LFRPFSPIMKVKNEKLNMANTLIWIVPIAKRVDKFRQFMQNFREMCIEQDGRVHLTVW 300 

LFRPF PIMKVK EKLN+ANTLINVIVPLA+RVDKFR FMQNFREMCI +QDGRVHLTVVY 
Sbjct: 23 9 LFRPFGPIMKVKKEKLNLANTLIIWIVPLARRVDKFRHFMQNFREMCIQQDGRVHLTVVY 298 

Query: 301 FGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 360 

FGKEE+NEVKG I LENTS KAANFRNFTF I QLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 
Sbjct: 299 FGKEEMNEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 358 

Query: 361 lYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRD 420 

lYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPG+IYGHHDAVPPL QQLVIKKETGFWRD 
Sbjct: 3 59 lYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGVIYGHHDAVPPLGQQLVIKKETGFWRD 418 

Query: 421 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKR 480 

FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEK 
Sbjct: 419 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKH 478 

Query: 481 CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEMLR 521 

CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 
Sbjct: 479 CMDELTPEQYJCMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 519 



U >gi I 26324624 | dbj | BAC26066 . 1 1 unnamed protein product [Mus musculus] 

Length =530 

Score = 1001 bits (2588), Expect = 0.0 

Identities = 4^6/521 (91%), Positives = 496/521 (95%), Gaps = 4/521 (0%) 

Query: 3 MVRRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQ 62 

MVRRGLL WISRW+LLVLLCCAISVLYMLACTPKGD+EQL LPRAN PTGK+GYQAVLQ 
Sbjct: 1 MVRRGLLGWISRWILLVLLCCAISVLYMLACTPKGDQEQLGLPRANGPTGKDGYQAVLQ 60 

Query: 63 EWEEQHRNYVS SLKRQ I AQLKEELQERSEQLRNGQYQASDAAGL- -GLDRSPPEKTQADL 120 

E EEQHRNYV+SLKRQIAQLK+ELQ RSEQ R+GQ QASDA L G DR P K QADL 
Sbjct: 61 EREEQHRNYVNSLKRQIAQLKDELQARSEQFRSGQDQASDATSLRSGWDREP--KAQADL 118 

Query: 121 LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 180 

LAFL QVDKAEV+AGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 
Sbjct: 119 LAFLRGQVDKAEVHAGVICLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 178 

Query: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 240 

VEAIESA+E+LN+P E+SP+ RPYTA+DFIEGIYRTERDKGTLYELTFKGDHKHEF+RL+ 
Sbjct: 179 VEAIESAMESLNSPVESSPHQRPYTAADFIEGIYRTERDKGTLYELTFKGDHKHEFQRLV 238 

Query: 241 LFRPFSPIMKVKNEKLNMANTLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWY 300 

LFRPF PIMKVK EKLN+ANTLINVIVPLA+RVDKFR FMQNFREMCI+QDGRVHLTWY 
Sbjct: 239 LFRPFGPIMKVKKEKLNLANTLINVIVPLARRVDKFRHFMQNFREMCIQQDGRVHLTWY 298 
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RE)=1088011356-16594-155894407553.BLASTQ4, 



http://www.iicbi.nliiiiiih.gov/BLAST/Blast.cgi 



Query: 301 FGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 360 

FGKEE+NEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 
Sbjct: 299 FGKEEMNEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 358 

Query: 361 lYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGI lYGHHDAVPPLEQQLVIKKETGFWRD 420 

I YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPG+I YGHHDAVPPL QQLVI KKETGFWRD 
Sbjct: 359 I YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGVIYGHHDAVPPLGQQLVI KKETGFWRD 418 

Query: 421 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKR 480 

FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEK 
Sbjct: 419 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKH 478 

Query: 481 CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 521 

CMDELT PEQYKMCMQS KAMNEASHGQLGMLVFRHE I EAHLR 
Sbjct: 479 CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 519 

□ >gij 9651033 | dbj 1 3A303554 . 1 1 hypothetical protein [Macaca fascicularis] 
Length =~T^^^^ 

Score = 915 bits (2364), Expect =0.0 

Identities = 443/453 (97%), Positives = 448/453 (98%) 

Query: 1 MMMVRRGLIAWISRVWLLVLLCCAISVLYMIACTPKGDEEQIALPRAN 60 

MMIWRRGLIAWISRVVVLLVLLCCAISVLYMLACTPKGD+EQLALPRANSPTGKEGY 
Sbjct: 1 MMMVRRGLLAWISRVWLLVLLCCAISVLYMIACTPKGDDEQLALPRANSPTGKEGYQAI 60 

Query: 61 LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADL 120 

LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLG DRSPPEKTQADL 
Sbjct: 61 LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGPDRSPPEKTQADL 120 

Query: 121 LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 180 

LAFLHSQVDKAEV+AGVKL TEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 
Sbjct: 121 LAFLHSQVDKAEVHAGVKLTTEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 180 

Query: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 240 

VEA I ESALETLNNPAENS PNHRPYTASDF I EG I YRTERDKGTLYELTFKGDHKHEFKRL+ 
Sbjct: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLV 240 

Query: 241 LFRPFSPIMKVKNEKLNMANTLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVY 300 

LFRPF PI+KVK EKLNMANTLINVIVPLAKRVDKF QFMQNFREMCI EQDGRVHLTWY 
Sbjct: 241 LFRPFGPIIKVKKEKLNMANTLIIWIVPLAKRVDKFWQFMQNFREMCIEQDGRVHLTVW 300 

Query: 301 FGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 360 

FGKEE I NEVKG I LENTSKAANFRMFT F I QLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 
Sbjct: 301 FGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVD 360 

Query: 361 lYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRD 420 

I YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGI I YGHHDAVPPLEQQLVI KKETGFWRD 
Sbjct: 361 I YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGII YGHHDAVPPLEQQLVI KKETGFWRD 420 

Query: 421 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHL 453 

FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHL 
Sbjct: 421 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHL 453 



LJ>9i |34e77724|ref |XP 224 7S 7.2| similar to chondroitin betal,4 N-acetylgalactosa 

[Rattus norvegicusi 
Length = 617 

Score = 828 bits (2139), Expect =0.0 

Identities = 395/448 (88%) , Positives = 417/448 (93%) 

Query: 3 ^WRRGLLAWISRVVVLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQ 62 

MVRRGL+ WISRW+LLVLLCCAISVLYMLACTPKGD+EQL LPRAN PTGK+GYQAVLQ 
Sbjct: 1 MVRRGLVGWISRWILLVLLCCAISVLYMLACTPKGDQEQLGLPRANGPTGKDGYQAVLQ 60 
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RID=1088011356-16594-155894407553.BLASTQ4, 



http://www.iicbi.nlninih.gov/BLAST/Blast.cgi 



Query: 63 EWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLA 122 

E EEQHRNYV+SLKRQIAQLK+ELQ RSEQLR+GQ QASDA L P K QAD+LA 

Sbjct: 61 EREEQHRNYVNSLKRQIAQLKDELQARSEQLRSGQDQASDATSLRSGWDPEPKAQADILA 120 

Query: 123 FLHSQVDKAEWAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVE 182 

FL QVDKAEV+AGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVE 
Sbjct: 121 FLRGQVDKAEVHAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVE 180 

Query: 183 AIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILF 242 

AIESALE+LN+P E+SP+ RPYTA+DFIEGIYRTERDKGTLYELTFKGDHKHEF+RL+LF 
Sbjct: 181 AIESALESLNSPVESSPHQRPYTAADFIEGIYRTERDKGTLYELTFKGDHKHEFQRLVLF 240 

Query: 243 RPFSPIMKVKNEKLNMANTLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVW^^ 302 

RPF PIMKVK EKLNMANTLINVIVPLA+RVDKFR FMQNFREMCI +QDGRVHLTVVYFG 
Sbjct: 241 RPFGPIMKVKKEKLNMANTLINVIVPLARRVDKFRHFMQNFREMCIQQDGRVHLTVVYFG 300 

Query: 303 KEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIY 362 

KEE+NEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSMV+LFFCDVDIY 
Sbjct: 301 KEEMNEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNWLFFCDVDIY 360 

Query: 363 FTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFG 422 

FTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGI I YGHHDAVP LEQQLVIKKETGFWRDFG 
Sbjct: 361 FTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPALEQQLVIKKETGFWRDFG 420 

Query: 423 FGMTCQYRSDFINIGGFDLDIKGWGGED 450 

FGMTCQYRSDFINIG F + + GED 
Sbjct: 421 FGMTCQYRSDFINIGVFLVTLDLMQGED 448 

Score = 201 bits (510), Expect = 5e-50 
Identities = 90/97 (92%), Positives = 93/97 (95%) 

Query: 436 IGGFDLDIKGWGGEDVHLYRKYLHSNLIVVRTPVRGLFHLWHEKRCMDELTPEQYKMCMQ 495 

+ GFDLDIKGWGGEDVHLYRKYLHSNLIV+RTPVRGLFHLWHEK CMDELTPEQY+MCMQ 
Sbjct: 521 VRGFDLDIKGWGGEDVHLYRKYLHSNLIVIRTPVRGLFHLWHEKHCMDELTPEQYRMCMQ 580 

Query: 4 96 SKAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSSKKT 532 

SKAMNEASHGQLGMLVFRHEIEAHLRKQK K SSKKT 
Sbjct: 581 SKAMNEASHGQLGMLVFRHEIEAHLRKQKLKASSKKT 617 

Li >9i I 47219910 I ervh \ CAF97130 . 1 1 unnamed protein product [Tetraodon nigroviridis] 
Length = 607'* 

Score 720 bits (1858), Expect = 0.0 

Identities = 368/609 (60%), Positives = 447/609 (73%), Gaps = 83/609 (13%) 

Query: 3 MVRRGLLAWISRVWLLVLLCCAISVLYMLACTP-KGDEEQLA- -LPRA 48 

M+RRGLLAW+SRV +LVLLC ++S+LY++ C+P D L LPRA 
Sbjct: 1 MLRRGLLAWVSRVGGVLVLLCSSLSLLYLMTCSPPHSDNHPLGHVLPRAAPVRPSLGGTG 60 

Query: 49 --NSPTGK EGYQAVLQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLR 94 

+ T K + YQ +LQE EEQHR ++SSLK+QIAQLKE LQERS+QL+ 

Sbjct: 61 AGDVGTAKARAAQNGAAPPAQSYQVLLQEREEQHRLHISSLKQQIAQLKEALQERSQQLK 120 

Query: 95 NGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQ 154 

Q S GLG + P+ QADL +L SQ+ KAEV AG +L +EYA VPF+SFTLQ 
Sbjct: 121 G--VQESIKTGLG-EAHGPKSQQADLQDYLRSQLTKAEVTAGTRLPSEYAWPFESFTLQ 177- 

Query: 155 KVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLN NPAENSPNHRPYT 205 

+VYQLETGLTRHPEEKPVRKD+RDEL E +E+AL +LN PA+ +P + Y+ 

Sbjct: 178 RVYQLETGLTRHPEEKPVRKDRRDELGEVLETALHSLNAPSAQQDDKTPAQKAPTSKVYS 237 

Query: 206 ASDFIE GIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKV 251 

SDF+E GI RTE+DKGT+YEL F+G+ +EF+RL+LFRPF P+MKV 

Sbjct: 238 PSDFVEDSSLSSWHPGAFSGISRTEKDKGTMYELIFRGEAANEFRRLVLFRPFGPLMKV 297 
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RII>1088011356-16594-155894407553.BLASTQ4, 



http://ww.iicbi.nlrniiih.gov/BLAST/Blast.cgi 



Query: 


252 


Sbjct : 


298 


Query : 


312 


Sbjct: 


358 


Query : 


372 


Sb j Ct : 


418 


Query : 


403 


Sbjct : 


478 


Query : 


463 


Sbjct : 


538 


Query : 


523 


Sbjct: 


598 



++E+++ A+ IN++VPL++R DKF+QFM NFRE+C+ QDGRVHLTWYFGKE+++EV+ 



LENTS+ F+N+T +QL+ EFSRG+GLDVGAR W+G NVLLFFCDVDIYFT++FLN C 



RLNTQPG KKVFYPVLFSQYNPGI I YGHHDAV 402 

RLN QPG KKVFYPVLFSQYNP +IYG + V 



PP+EQQLVIKK+TGFWRDFGFGMTCQYRSDFINIGGFD+DIKGWGGEDVHLYRKYLHSNL 



+WRTP RGLFHLWHEK C DEL P+QY+MCMQSKAMNEASHGQLGML FRHEIEAHLRK 



QKQ+ ++ K 



^ >g-'i I 244.29592 | ref | NP_061060 . 3 | chondroitin betal, 4 N-acetylgalactosaminyltransf 

sapiens] 

gi ! 20988438 i gb j AAH3 026 8 . 1 \ Chondroitin betal, 4 N-acetylgalactosaminyltransf erase 

sapiens] 

beta- 1 , 4-N-acethylgalactosaminyltransf erase [Homo s^ 
chondroitin betal, 4 N-acetylgalactosaminyltransf eraj 





27922994 idbj 


BACF;593 5, 1 j 


gi 


27923015|dbj 


B.AC55936,l| 



sapiens] 
Length = 542 

Score = 665 bits (1717), Expect = 0.0 

Identities = 316/512 (61%), Positives = 396/512 (77%), Gaps = 11/512 (2%) 



+Y+L C P+ D GKE YQA+LQE EE ++ +SLKRQIAQLK+ELQE 



SE++R+ Q + + A G+G +S E+ +DLL FLHSQ+DKAEV+ G KL +EY +P 



F+SFTL KV+QLE GLTRHPEEKPVRKDKRDELVE IE+ LE +NNP E+ 



- YTASDF I EG I YRTERDKGTLYELTFKGDHKHEFKRL I LFRPFS P I MKVKNEKLNM 258 
+ +DF+EG YRTERDKGT YEL FK E++ + LFRPF P+MKVK+E +++ 



Query : 


29 


Sbjct: 


29 


Query : 


89 


Sbjct: 


89 


Query : 


148 


Sbjct : 


148 


Query: 


204 


Sbjct: 


208 


Query : 


259 


Sbjct : 


268 


Query : 


319 


Sbjct: 


328 


Query: 


379 


Sbjct: 


388 



++IN+IVPLA+R + F QFMQNFR++CI QD ++HLTVVYFGKE +++VK ILE+ + 



+NF N+T + LN EF+RG+GL+VGAR W VL+FFCDVDIYF++EFLN4-CRLN +PG 



KKVFYPV+FS YNP I+Y + + PP+EQQLV KK++GFWRDFGFGMTCQYRSDF+ IGG 
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RE)=1 0880 11 356-1 6594-1 55894407553.BLASTQ4, 



http://www.ncbi.iilmnih.gov/BLAST/Blast.cgi 



Query: 439 FDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKA 4 98 

FD+++KGWGGEDVHLYRICYLH +LIV+RTPV GLFHLWHEKRC DELTPEQY+MC+QSKA 
Sbjct: 448 FDMEVKGWGGEDVHLYRKYLHGDLIVIRTPVPGLFHLWHEKRCADELTPEQYRMCIQSKA 507 

Query: 499 MNEASHGQLGMLVFRHEIEAHLRKQKQKTSSK 530 

MNEASH LGMLVFR EIE HL KQ +T+S+ 
Sbjct: 508 MNEASHSHLGMLVFREEIETHLHKQAYRTNSE 539 



U > gi I 3X560032 | ref | NP Q84441 . 3 | chondroitin sulfate GalNAcT-2 [Mus musculus] 

gi I 26389329 j dbj iBAC25717. 1 j H unnamed protein product [Mus musculus] 
Length = 542 

Score = 659 bits (1701), Expect = 0.0 

Identities = 315/514 (61%), Positives = 394/514 (76%), Gaps = 15/514 (2%) 

Query: 29 LYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWEEQHRNYVSSLKRQIAQLKEELQE 88 

+Y+L C P+ D GKE YQA+LQE EE ++ +SLKRQIAQLK+ELQ+ 

Sbjct: 29 MYLLECAPQTDGNASLPGWRENYGKEYYQALLQEQEEHYQTRATSLKRQIAQLKQELQD 88 

Query: 89 RSEQLRNGQYQAS-DAAGLGLDRSPPEKTQA- -DLLAFLHSQVDKAEVNAGVKLATEYAA 145 

SE++R Q + A G+G P + QA DLL FLHSQ+D+AEV+ G KL +EY 
Sbjct: 89 MSEKMRALQERKKLGANGVGY---PGNREQAPSDLLEFLHSQIDRAEVSVGAKLPSEYGV 145 

Query: 146 VPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPAENSPNHRP- - 203 

VPF+SFTL KV+QLE GLTRHPEEKPVRKDKRDELVE IE+ +E +NNP E+ 
Sbjct: 146 VPFESFTLMKVFQLEMGLTRHPEEKPVRKDKRDELVEVIEAGVEVINNPDEDDAQEDEEG 205 

Query: 204 YTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKL 256 

+ +DFIEG YRTERDKGT YEL FK E+-1- + LFRPF P+MKVKNE + 

Sbjct: 206 PLGEKLIFNENDFIEGYYRTERDKGTQYELFFKKADLMEYRHVTLFRPFGPLMKVKNE 265 

Query: 257 NMANTLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENT 316 

++ ++IN+IVPLA+R + F QFMQNFR++CI QD R+HLTWYFGKE +++VK ILE+ 
Sbjct: 266 DITRSVINIIVPLAERTEAFSQFMQNFRDVCIHQDKRIHLTWYFGKEGLSKVKSILESV 325 

Query: 317 SKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQ 376 

S ++F N+T + L+ EF+RG+GL+VGAR W VL+FFCDVDIYF++EFLN+CRLN + 
Sbjct: 326 SSESDFHNYTLVSLDEEFNRGRGLNVGARAWDKGEVLMFFCDVDIYFSAEFLNSCRLNAE 385 

Query: 377 PGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINI 436 

PGKKVFYPV+FS YNP I+Y + D PP+EQQLV KK++GFWRDFGFGMTCQY+SDF+++ 
Sbjct: 386 PGKKVFYPWFSLYNPAIVYANQDVPPPVEQQLVHKKDSGFWRDFGFGMTCQYQSDFLSV 445 

Query: 437 GGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQS 4 96 

GGFD+++KGWGGEDVHLYRKYLH +LIV+RTPV GLFHLWHEK C DELTPEQY+MC+QS 
Sbjct: 446 GGFDMEVKGWGGEDVHLYRKYLHGDLIVIRTPVPGLFHLWHEKHCADELTPEQYRMCIQS 505 

Query: 497 KAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSSK 530 

KAMNEASH LGM+VFR EIE HLRKQ +T+S+ 
Sbjct: 506 KAMNEASHSHLGMMVFREEIEMHLRKQAYRTNSE 539 



LJ >gi I 263 36 533 | d bj | 3A3317 61_^2 J unnamed protein product [Mus musculus] 

Length = 542 

Score 657 bits (1696), Expect =^0.0 

Identities = 314/514 (61%), Positives = 394/514 (76%),. Gaps = 15/514 (2%) 

Query: 29 LYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWEEQHRNYVSSLKRQIAQLKEELQE 88 

+Y+L C P+ D GKE YQA+LQE EE ++ +SLKRQIAQLK+ELQ+ 

Sbjct: 29 MYLLECAPQTDGNASLPGWRENYGKEYYQALLQEQEEHYQTRATSLKRQIAQLKQELQD 88 

Query: 89 RSEQLRNGQYQAS-DAAGLGLDRSPPEKTQA- -DLLAFLHSQVDKAEVNAGVKLATEYAA 145 

SE++R Q + A G+G P + QA DLL FLHSQ+D+AEV+ G KL +EY 
Sb j Ct : 89 MSEKMRALQERKKLGANGVGY- - - PGNREQAPSDLLEFLHSQIDRAEVSVGAKLPSEYGV 145 
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Query: 146 VPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPAENSPNHRP- - 203 

VPF+SFTL KV+QLE GLTRHPEEKPVRKDKRDELVE IE+ +E +NNP E+ 
Sbjct: 146 VPFESFTLMKVFQLEMGLTRHPEEKPVRKDKRDELVEVIEAGVEVINNPDEDDAQEDEEG 205 

Query: 204 YTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKL 256 

+ +DFIEG YRTERDKGT YEL FK E++ + LFRPF P+MKVKNE + 

Sbjct: 206 PLGEKLIFNENDFIEGYYRTERDKGTQYELFFKKADLMEYRHVTLFRPFGPLMKVKNELI 265 

Query: 257 NMANTLIWIVPIAKRVDKFRQFMQNFREMCIEQDGRVHLTVWFGKEEINEVK^ 316 

++ + + IN+IVPLA+R + F QFMQNFR++CI QD R+HLTWYFGKE +++VK ILE+ 
Sbjct: 266 DITRSVINIIVPLAERTEAFSQFMQNFRDVCIHQDKRIHLTWYFGKEGLSKVKSILESV 325 

Query: 317 SKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQ 376 

S ++F N+T + L+ EF+RG+GL+VGAR W VL+FFCDVDIYF++EFLN+CRLN + 
Sbjct: 326 SSESDFHNYTLVSLDEEFNRGRGLNVGARAWDKGEVLMFFCDVDIYFSAEFLNSCRLNAE 385 

Query: 377 PGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINI 436 

PGKKVFYPV+F+ YNP I+Y + D PP+EQQLV KK++GFWRDFGFGMTCQY+SDF+++ 
Sbjct: 386 PGKKVFYPWFNLYNPAIVYANQDVPPPVEQQLVHKKDSGFWRDFGFGMTCQYQSDFLSV 445 

Query: 437 GGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQS 496 

GGFD+++KGWGGEDVHLYRKYLH +LIV+RTPV GLFHLWHEK C DELTPEQY+MC+QS 
Sbjct: 446 GGFDMEVKGWGGEDVHLYRKYLHGDLIVIRTPVPGLFHLWHEKHCADELTPEQYRMCIQS 505 

Query: 497 KAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSSK 530 

KAMNEASH LGM+VFR EIE HLRKQ +T+S+ 
Sbjct: 506 KAMNEASHSHLGMMVFREEIEMHLRKQAYRTNSE 539 

LJ >gi \3J^^i:2!?.2-?.} ^^P\ jj.:..-!:.J unnamed protein product [Tetraodon nigroviridis] 

'"Length = 496 

Score = 577 bits (1487), Expect = e-163 

Identities = 284/494 (57%), Positives = 370/494 (74%), Gaps = 28/494 (5%) 

Query: 52 TGKEGYQAVLQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGL- -GLD 109 

T +EGY A+LQE E+ HR Y++SL RQI QLKE L ER++QL++ +A L GL+ 

Sbjct: 3 TSREGYMALLQEREDSHRRYINSLTRQIGQLKEALLERTQQLQDSLEKAQTGGILPGGLE 62 

Query: 110 RSPPEKTQADLLAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQK 155 

TQ+DL FL SQ+D+AEVN+GVKL+ EY +P+D+FTLQ+ 
Sbjct: 63 SLRKTPTQSDLKEFLRSQLDRAEVNSGVKLSGEYEVIPYDTFTLQRWVTELPKPSPMWLH 122 

Query: 156 VYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPAEN SPNHRPY 204 

VYQLETGL+RHPEE+PVR+D+R EL A+E+AL LN P ++ + + 
Sbjct: 123 REALLSPRVYQLETGLSRHPEERPVRRDRRAELTGAVETALHLLNGPEQHGDGAAGKHTH 182 

Query: 2 05 TASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMANTLIN 264 

+ +DF+EG+ RTERD+GT+YELTFK +L+LFRPF P++KV++E +++++ L-fN 

Sbjct: 183 SPADFVEGLTRTERDRGTVYELTFKDKGPRGLSQLVLFRPFGPLLKVRSESVDLSSVLVN 242 

Query: 265 VIVPLAPCRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRN 324 

V+VPL+ R + FR+F+ NFR CI+QDGRVHLTW+FG + + +VK +L+ TS+ FRN 
Sbjct: 243 VWPLSGRTEAFRRFIANFRRACIQQDGRVHLTWHFGGDPVEQVKALLDQTSRETRFRN 302 

Query: 325 FTFIQLNGEFSRGKGLDVGARFWKGS-NVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFY 383 

FT IQLN FSRG+GL++GAR W+ S NVLLFFCDVD++FT++FL +CRLN PGKKV+Y 
Sbjct: 303 FTLIQLNEVFSRGRGLEIGARAWRRSQNVLLFFCDVDVHFTADFLTSCRLNALPGKKVYY 362 

Query: 3 84 PVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDI 443 

PVLFS YNP +IY +H P L+QQLVI +KE+GFWRDFGFGMTCQYRSDFINIGGFD I 
Sbjct: 363 PVLFSLYNPSVIYHNHTHPPSLQQQLVIRKESGFWRDFGFGMTCQYRSDFINIGGFDRAI 422 

Query: 444 KGWGGEDVHLYRKYLHSNLIVVRTPWGLFHLWHEKRCMDELTPEQYKMCMQSKAMNE^ 503 

+GWG EDVHLYRKYLHS L+WR+P R LFHLWHEK C DEL P++Y+MCMQ+KA++EAS 
Sbjct: 423 RGWGLEDVHLYRKYLHSKLMVWSPSRSLFHLWHEKVCADELPPDKYRMCMQTKAVS^^ 482 
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Query: 504 HGQLGMLVFRHEIE 517 

HG+LG LVFR E 
Sbjct: 483 HGRLGELVFRRRDE 496 



J > gi I 38174240 I gb I AAI46Q772 . 1 1 ChGn protein [Homo sapiens] 

Length = 297 

Score = 557 bits (1436), Expect = e-157 

Identities = 282/284 (99%), Positives = 283/284 (99%) 

Query: 1 MM^WRRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRAN 60 

MMMVRRGLLAW I SRVWLLVLLCCAI SVLYMLACTPKGDEEQLALPRANS PTGKEGYQAV 
Sbjct: 1 MMMVRRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAV 60 

Query: 61 LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADL 120 

LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADL 
Sbjct: 61 LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADL 120 

Query: 121 LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 180 

LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 
Sbjct: 121 LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 180 

Query: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 240 

VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFICRLI 
Sbjct: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 240 

Query: 241 LFRPFSPIMKVKNEKLNMANTLINVIVPLAKRVDKFRQFMQNFR 284 

LFRPF PIMKV-fNEKLNMANTLINVIVPLAICRVDKFRQFMQNFR 
Sbjct: 241 LFRPFGPIMCTENEKLNMANTLINVIVPLAKRVDKFRQFMQNFR 284 



LJ > gi I 2634 04 98 I dbj | BAC33 912 . 1 1 unnamed protein product [Mus musculus] 

Length =300 

Score = 532 bits (1370) , Expect = e-150 

Identities = 262/302 (86% ),. Positives = 279/302 (92%), Gaps = 4/302 (1%) 

Query: 3 MVRRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQ 62 

MVRRGLL WISRW+LLVLLCCAISVLYMLACTPKGD+EQL LPRAN PTGK+GYQAVLQ 
Sbjct: 1 MVRRGLLGWISRWILLVLLCCAISVLYMLACTPKGDQEQLGLPRANGPTGKDGYQAVLQ 60 

Query: 63 EWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGL- -GLDRSPPEKTQADL 120 

E EEQHRNYV+SLKRQIAQLK+ELQ RSEQ R+GQ QASDA L G DR P K QADL 
Sbjct: 61 EREEQHRNYVNSLKRQIAQLKDELQARSEQFRSGQDQASDATSLRSGWDREP--KAQADL 118 

Query: 121 LAFLHSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 180 

LAFL QVDKAEV+AGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 
Sbjct: 119 LAFLRGQVDKAEVHAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDEL 178 

Query: 181 VEAIESALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLI 240 

VEAIESALE+LN+P E+SP+ RPYTA+DFIEGIYRTERDKGTLYELTFKGDHKHEF+RL+ 
Sbjct: 179 VEAIESALESLNSPVESSPHQRPYTAADFIEGIYRTERDKGTLYELTFKGDHKHEFQRLV 238 

Query: 241 LFRPFSPIMKVKNEKIJ^MANTLIWIVPIAKRVD^ 300 

LFRPF PIMKVK EKLN+ANTLINVIVPLA+RVDKFR FMQNFREMCI+QDGRVHLTWY 
Sbjct: 239 LFRPFGPIMKVKKEKLNLANTLINVIVPLARRVDKFRHFMQNFREMCIQQDGRVHLTVVY 298 

Query: 3 01 FG 302 
FG 

Sbjct: 299 FG 300 

LJ > gi I 7023S13 I dbj \ aAA92093 . 1 1 unnamed protein product [Homo sapiens] 

Length = 275 

Score = 523 bits (1347), Expect = e-147 
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Identities = 238/239 (99%), Positives = 239/239 (100%) 

Query: 283 FREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDV 342 

FREMCIEQDGRVHLTWYFGKEEINEVKG+LENTSKAANFRNFTFIQLNGEFSRGKGLDV 
Sbjct: 26 FREMCIEQDGRVHLTWYFGKEEINEVKGVLENTSKAANFRNFTFIQLNGEFSRGKGLDV 85 

Query: 343 GTU^FWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAV 402 

GARFWKGSIWLLFFCDVDIYFTSEFIJJTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAV 
Sbjct: 86 GARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAV 145 

Query: 403 PPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNL 462 

PPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNL 
Sbjct: 146 PPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNL 205 

Query: 463 IWRTPVRGLFHLIffiEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 521 

I WRT PVRGLFHLWHEKRCMDELTPEQYKMCMQS KAMNEASHGQLGMLVFRHE I EAHLR 
Sbjct: 206 IWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 264 



O >gi 1 20072996 | gb | AAH26599 . 1 1 473243 5N03Rik protein [Mus tnusculus] 

Length = 239 

Score = 494 bits (1272) , Expect = e-138 

Identities = 224/228 (98%), Positives = 226/228 (99%) 

Query: 294 VHLTWYFGKEEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVL 353 

VHLTWYFGKEE+NEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVL 
Sbjct: 1 VHLTWYFGKEEMNEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVL 60 

Query: 354 LFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKK 413 

LFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPG+IYGHHDAVPPL QQLVIKK 
Sb j c t : 61 LFFCDVDI YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGVI YGHHDAVPPLGQQLVI KK 120 

Query: 414 ETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLF 473 

ETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLF 
Sbjct: 121 ETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLF 180 

Query: 474 HLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 521 

HLWHEK CMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 
Sbjct: 181 HLWHEKHCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLR 228 



LJ> gi|3485S369|ref |XP 232316>2| similar to chondroitin betal,4 N-acetylgalactosa 

[Rattus norvegicus] 
Length - 462 

Score = 486 bits (1251), Expect = e-136 

Identities - 214/317 (67%), Positives = 266/317 (83%) 

Query: 214 YRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVI<NEKLNMANTLINVIVPLAKRV 273 

YRTERDKGT YEL FK , E++ + LFRPF P+MKVK+E +++ ++IN+IVPLA+R 
Sbjct: 143 YRTERDKGTHYELFFKKADLMEYRHVTLFRPFGPLMKVKSELIDITRSVINIIVPLAERT 202 

Query: 274 DKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNFTFIQLNGE 333 

+ F QFMQNFR++CI QD R+HLTWYFGKE ++ VK ILE+ S +NF N+T + LN E 
Sbjct: 203 EAFSQFMQNFRDVCIHQDKRIHLTWYFGKEGLSTVKSILESVSSESNFHNYTLVSLNEE 262 

Query: 334 FSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPG 393 

F+RG+GL+VGAR W VL+FFCDVPIYF++EFLN-I-CRLN +PGKKVFYPV+FS YNP 
Sbjct: 263 FNRGRGLNVGARTWDKGEVLMFFCDVDIYFSAEFLNSCRLNAEPGKKVPYPWFSLYNPA 322 

Query: 394 IIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHL 453 

I+Y + + PP+EQQLV KK++GFWRDFGFGMTCQY+SDF-f++GGFD+++KGWGGEDVHL 
Sbjct: 323 IVYANQEVPPPVEQQLVHKKDSGFWRDFGFGMTCQYQSDFLSVGGFDMEVKGWGGEDVHL 382 

Query: 4 54 YRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFR 513 
YRKYLH +LIV+RTPV GLFHLWHEK C DELTPEQY+MC+QSKAMNEASH LGM+VFR 
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Sbjct: 383 YRKYLHGDLIVIRTPVPGLFHLWHEKHCADELTPEQYiy^CIQSK?UvnsrEASHSHLGM^ 442 

Query: 514 HEIEAHLRKQKQKTSSK 530 

EIE HLRKQ +T+S+ 
Sbjct: 443 EEIEMHLRKQAYRTNSE 459 



Score = 99.8 bits (247), Expect = 2e-19 

Identities = 63/144 (43%), Positives = 87/144 (60%), Gaps = 4/144 (2%) 

Query: 3 IWRRG- -LLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAV 60 

M RRG L + ++V L LL + +Y+L C P+ D GKE YQA+ 

Sbjct: 1 MSRRGPILHSRTQWLLVGLALLFSLVLFMYLLECAPQTDGNASLPGWRENYGKEYYQAL 60 

Query: 61 LQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQAS-DAAGLGLDRSPPEKTQAD 119 

LQE EE ++ +SLKRQIAQLK+ELQ+ SE++R Q + A G+G + E+T +D 
Sbjct: 61 LQEQEEHYQTRATSLKRQIAQLKQELQDMSEKMRALQERKKLGANGIGY-QGNREQTPSD 119 

Query: 120 LLAFLHSQVDKAEVNAGVKLATEY 143 

LL FLHSQ+D+AEV+ G KL + Y 
Sbjct: 120 LLEFLHSQIDRAEVSIGAKLPSYY 143 

Lj >gi I 47216537 | eiT^ | CAG04715 . 1 1 unnamed protein product [Tetraodon nigroviridis] 
Length = 522 

Score = 485 bits (1248), Expect = e-135 

Identities = 235/413 (56%), Positives = 298/413 (72%), Gaps = 8/413 (1%) 

CTPKGDEEQLALPRANS PTGKEGYQAVLQEWEEQHRNYVSSLKRQIAQLKEELQERSEQL 93 
CTP D . + A GKE YQA+LQE EE+H N +SLICRQIAQLK+ELQE S++L 



+ Q + A GL + ++ DLL +LHSQ+DKAEVN G + +EYA +PF+SFT 



KVYQLE GLTRHPEEKPVR+D+RDELVE +E+AL+ +NNP E+ P R YT 



F EG+YRTERDKGTLYEL F + F+ + LFRPF P+MKV++ + + +IN+I 



VPL+ RV+ F QF+ NFRE+CI D RVHLTWYFG+E + EVK LE S+ +F N+T 



I ++ EFSRG+GLD+GA WK +VL+FFCDVDI+F+ EFLNTCRL+ P K+VFYPV+ 



FS YNP I+YG+ + PP+E QL+ KK+ GFWRDFGFGMTCQYRSDF+NI F 
FSLYNPAIVYGNLELAPPIELQLIHKKDAGFWRDFGFGMTCQYRSDFLNIASF 445 

>gi I 31198419 I ref | XF_3081S7 . 1 1 ENSANGPOOOO 002 1559 [Anopheles gatnbiae] 
gi ! 21292063 j gb j RfiA04 2 087'rr"'' ENSANGP00000021559 [Anopheles gambiae str. PEST] 
Length =648 

Score - 382 bits (982), Expect = e-105 

Identities = 222/547 (40%), Positives = 310/547 (56%), Gaps = 68/547 (12%) 

Query: 47 RANSPTGKEGYQAVLQEWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQA 100 

+A++ + GY +LQ+ EE++ V+ L +1 LK ++ QL+NG A 



Query: 


34 


Sbjct: 


34 


Query: 


94 


Sbjct: 


94 


Query : 


154 


Sbjct: 


153 


Query : 


207 


Sbjct: 


213 


Query: 


267 


Sbjct: 


273 


Query : 


327 


Sbjct: 


333 


Query : 


387 


Sbjct: 


393 
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Sbjct: 81 QADAEQQQNGYMQMLQQREEENLKEVAKLTAEIKALKLQIL QLKNGLTNAGMGMVQ 136 

Query: 101 SDAAGLGLDRSP PEKTQA- -DLLAFLHSQVDKAEVNAGVKLATEYAAVPFD 149 

A L D S P+ +Q D AF+ QV AE+ G+ L EY +PF+ 

Sbjct: 137 PNWEAAVSLSNDSSAIVANAPQTSQMLHDCTAFIRRQVGSAEILHGLPLNNEYELIPFN 196 

Query: 150 SFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPAENSPNHRPYTASDF 209 

FT +VY +E GL + EKP+ KR +++ A+ ALETLN A +S YT DF 

Sbjct: 197 HFTFSRVYPIELGLGKRVVEKPIGY-KRKDILSALNKALETLNRNASSSAQR--YTLDDF 253 

Query: 210 lEGIYRTERDKGTLYELTFKGDHK HEFKRLILFRPFSPIMKVKN 253 

lEGIYR E GT YEL F+ H +LI+ RPF+ + V+ 

Sbjct: 254 lEGIYRNEPTTGTQYELYFRTKESANRSQQQQQIAQHHESHGTTKLIVMRPFASLQTVQL 313 

Query: 254 EKI1N--MANTLIWIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKG 311 

E + +1 +I+PL+ R F+ FM+ + ++ ++ D RVHLTWYFG+E + E + 

Sbjct: 314 EAYSKHQEKEIIYIILPLSGRTSTFQSFMEKYVKIALKHDKRVHLTWYFGEEGLAEART 373 

Query: 312 ILENT SKAANFRNFTFIQLNGEFSRGKGLDVGAR-FW KGSNVLLFFCDVDI 361 

1+ + A N + LN FSR K L VGA W K +++LLF CDVDI 

Sbjct: 374 IMSRVIGMIOSfSGATNSNLKLIALNETFSRAKALRVGAENWSSQADKNNDILLFM 433 

Query: 362 YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGI lYG-HHDAVPPLEQQLVIKKETGFWRD 420 

F+++FL+ CR NT+P KKV+YPV+FS YNP ++Y VPP QLVI K++GFWRD 

Sbjct: 434 VFSAKFLDRCRWNTKPNKKVYYPWFSLYNPHWYTLQGKDVPPETDQLVISKDSGFWRD 4 93 

Query: 421 FGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKR 480 

FG+GMTCQYRSDF+ + GFD +1 GWGGEDV LYRKY+ S++ V+R G+FH+WH K 
Sbj ct : 494 FGYGMTCQYRSDFLRVRGFDEEI IGWGGEDVMLYRKYVRSHIKVIRATDPGVFHIWHPKV 553 

Query: 481 CM DELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL RKQKQ 525 

C LT +QY+ C++S+A+NEASH QLG L FR +1 A+ K Q 

Sbjct: 554 CTGPVMSVTSNQRLTLDQYRACIRSRALNEASHAQLGFLAFRDDIAANEYILAQGAKLNQ 613 

Query: 526 KTSSKKT 532 

++S+ KT 
Sbjct: 614 ESSTTKT 620 

Li >gi I 34365000 I eiT^) | CAS46036 . 1 1 hypothetical protein [Homo sapiens] 
Length = 216 

Score = 361 bits (927), Expect = 2e-98 

Identities = 154/211 (72%) , Positives = 183/211 (86%) 

Query: 320 ANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGK 3 79 

+NF N+T + LN EF+RG+GL+VGAR W VL+FFCDVDIYF++EFLN+CRLN +PGK 
Sbjct : 3 SNFHNYTLVSLNEEFNRGRGLNVGARAWDKGEVLMFFCDVDIYFSAEFLNSCRLNAEPGK 62 

Query: 380 KVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGP 439 

KVFYPV+FS YNP I+Y + + PP+EQQLV KK++GFWRDFGFGMTCQYRSDF+ IGGF 
Sbjct: 63 KVFYPWFSLYNPAIVYANQEVPPPVEQQLVHKKDSGFWRDFGFGMTCQYRSDFLTIGGF 122 

Query: 440 DLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAM 499 

D+++KGWGGEDVHLYRKYLH +LIV+RTPV GLFHLWHEKRC DELTPEQY+MC+QSKAM 
Sbjct: 123 DMEVKGWGGEDVHLYRKYLHGDLIVIRTPVSGLFHLWHEKRCADELTPEQYRMCIQSKAM 182 

Query: 500 NEASHGQLGMLVFRHEIEAHLRKQKQKTSSK 530 

NEASH LGMLVFR EIE HL KQ +T+S+ 
Sbjct: 183 NEASHSHLGMLVFREEIETHLHKQAYRTNSE 213 

O >gi I 24652379 | ref | NF_61QS67 . 1 1 CG12913-PA [Drosophila melanogaster] 

aiJ.215122lLa]2l^:MMQll:^^^^^^ CG12913-PA [Drosophila melanogaster] 

Leng th = 54 6 

Score ^ 351 bits (901) , Expect. 2e-95 
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Identities = 208/519 (40%), Positives = 293/519 (56%), Gaps = 59/519 (11%) 

Query: 39 DEEQLALPRANSPTGKEGYQAVLQEWE EQHRNYVSSLKRQIAQLKEELQERSEQLRN 95 

DE +AN+P+ G+ LEE E+ + V +L+ QI L+ + + N 

Sbjct: 34 DELTTRETQANNPSESSGFSYTLSEREAEIERLKQEVLALRTQILFLQN--NRSTAKPSN 91 

Query: 96 GQYQASDAAGLGLDRSPPEKT QADLLAFLHSQVDKAEVNAGVKLATEYAAVPFDSFT 152 

G Q + PP D +++ QV AE+ G+ L EY +P++ FT 

Sbjct: 92 GSLQLQETTA GPPTAPLGHHYDCSSYIRKQVGAAEILHGLPLNNEYELIPYNHFT 146 

Query: 153 LQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPAENSPNHRP 203 

+VY ++ GL + EKP+ +RD L-fEA+ ALE+LN +S R 
Sbjct: 147 FTRVYPIDLGLGKRWEKPIGYRRRD-LIEAVNKALESLNR--NHSARIRAKGAGSAAAY 203 

Query: 204 YTASDFIEGIYRTERDKGTLYELTFKGDHKHE- - -FKRLILFRPFSPIMKVKNE 254 

YT DFIEGIYR E GT YEL F+ KH+ +R ++ RPF-I-P+ V+ 
Sbjct: 204 ASDVIKYTLDDFIEGIYRNEPTTGTQYELYFQ-SVKHQASPVRRALVMRPFAPLQTVQLS 262 

Query: 255 KLNMA NTLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGK 3 03 

+L+ + +I+VI+PLA R+ ■ FR F+Q F ++ +D R+ L WYFG 
Sbjct: 263 ELSSSVDNSGAPPSHSPPIIHVILPLAGRLHSFRGFLQMFAKL EDRRLELIWYFG- 318 

Query: 304 EEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGS- -NVLLFFCDVDI 361 

G+ + S A + F+ LN FSR K L +GA + + +VLLF CDVDI 
Sbjct: 319 TSGLEQARSLAGRSQRTQFLALNETFSRAKALRLGAEHIQPAEEDVLLFMCDVDI 373 

Query: 362 YFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYG-HHDAVPPLEQQLVIKKETGFWRD 420 

FT++FL CR N PGKKV+YPV+FS YNP ++Y +P E+QLVI ++TGFWRD 

Sbjct: 374 MFTTKFLERCRWNAAPGKKVYYPVVFSLYNPHVVYSLQGKPLPSEEEQLVISRDTGFWRD 433 

Query: 421 FGFGMTCQYRSDFINIGGFDLD-IKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEK 479 

FG+GMTCQYRS+F+ + GFD + I GWGGEDV LYRKY+ S + ++R G+FH WH K 
Sbjct: 434 FGYGMTCQYRSNFLKVRGFDEEEIVGWGGEDVMLYRKYVRSKIKIIRATDPGIFHRWHTK 493 

Query: 480 RCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEA 518 

C LT +QY+ C++S+A+NEASH QLG L FR +1 A 
Sbjct: 4 94 ICSSSLTADQYRACIRSRALNEASHAQLGFLAFRDDIAA 532 

U >gi 1 11360289 |pir | | T46919 hypothetical protein DKFZp762L037 . 1 - human (fragment) 
9i i 7018526 I embj CAB75673 . 1 j hypothetical protein [Homo sapiens] 

Length =""l52 

Score = 317 bits (812), Expect = 5e-85 

Identities = 141/141 (100%), Positives = 141/141 (100%) 

Query: 381 VFYPVLFSQYNPGI lYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFD 440 

VFYPVLFSQYNPGII YGHHDAVPPLEQQLVI KKETGFWRDFGFGMTCQYRSDFINIGGFD 
Sbjct : 1 VFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFD 60 

Query: 441 LDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMN 500 

LDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMN 
Sb j C t : 61 LDI KGWGGEDVHLYRKYLHSNLIVVRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMN 12 0 

Query: 501 EASHGQLGMLVFRHEIEAHLR 521 

EASHGQLGMLVFRHEIEAHLR 
Sbjct: 121 EASHGQLGMLVFRHEIEAHLR 141 

LJ >gi 1 18677033 | dbj | HA335092 . 1 1 unnamed protein product [Homo sapiens] 

Length = iso 

Score = 317 bits (811) , Expect = 5e-85 

Identities = 134/177 (75%), Positives = 158/177 (89%) 

Query: 354 LFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGI I YGHHDAVPPLEQQLVI KK 413 
+FFCDVDIYF++EFLN+CRLN +PGKKVFYPV+FS YNP I+Y + + PP+EQQLV KK 
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Sbjct: 1 MFFCDVDIYFSAEFLNSCRLNAEPGKKVFYPWFSLYNPAIVYANQEVPPPVEQQLVHKK 60 

Query: 414 ETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLF 473 

++GFWRDFGFGMTCQ+RSDF+ IGGFD+++KGWGGEDVHLYRKYLH +LIV+RTPV GLF 
Sbjct: 61 DSGFWRDFGFGMTCQHRSDFLTIGGFDMEVKGWGGEDVHLYRKYLHGDLIVIRTPVPGLF 120 

Query: 474 HLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSSK 530 

HLWHEKRC DELTPEQY+MC+QSKAMNEASH LGMLVFR EIE HL KQ +T+S+ 
Sbjct: 121 HLWHEKRCADELTPEQYRMCIQSKAMNEASHSHLGMLVFREEIETHLHKQAYRTNSE 177 

J >9.i I 343 653 10 | err;b | CAE4 5 9 3 2 . 1 [ hypothetical protein [Homo sapiens] 

Length'^'" i" 3 3" 

Score = 314 bits (805), Expect = 3e-84 

Identities = 163/303 (53%), Positives = 216/303 (71%), Gaps = 11/303 (3%) 

Query: 29 LYMLACTPKGDEEQLALPRT^SPTGKEGYQAVLQEWEEQHRNYVSSLKRQIAQLKEELQE 88 

+Y+L C P+ D GKE YQA+LQE EE ++ +SLKRQIAQLK+ELQE 

Sbjct: 29 MYLLECAPQTDGNASLPGWGENYGKEYYQALLQEQEEHYQTRATSLKRQIAQLKQELQE 88 

Query: 89 RSEQLRNGQYQAS-DAAGLGLDRSPPEKTQADLLAFLHSQVDKAEVNAGVKLATEYAAVP 147 

SE++R+ Q + + A G+G +S E+ +DLL FLHSQ+DKAEV+ G KL +EY +P 
Sbjct: 89 MSEKMRSLQERRNVGANGIGY-QSNKEQAPSDLLEFLHSQIDKAEVSIGAKLPSEYGVIP 147 

Query: 148 FDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPAENSPNHRP 203 

F+SFTL KV+QLE GLTRHPEEKPVRKDKRDELVE IE+ LE +NNP E+ 
Sbjct: 148 FESFTLMKVFQLEMGLTRHPEEKPVRKDKRDELVEVIEAGLEVINNPDEDDEQEDEEGPL 207 

Query: 204 YTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNM 258 

+ +DF+EG YRTERDKGT YEL FK E++ + LFRPF P+MKVK+E +++ 

Sbjct: 208 GEKLIFNENDFVEGYYRTERDKGTQYELFFKKADLTEYRHVTLFRPFGPLMKVKSEMIDI 267 

Query: 259 ANTLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSK 318 

++IN+IVPLA+R + F QFMQNFR++CI QD ++HLTVVYFGKE +++VK ILE+ ++ 
Sbjct: 268 TRSIINIIVPLAERTEAFVQFMQNFRDVCIHQDKKIHLTWYFGKEGLSKVKSILESVTR 327 

Query: 319 AAN 321 
A+ 

Sbjct: 328 LAS 330 



U > gi I 37805317 | gb | AAH60169 . 1 1 Galnact2 protein [Mus musculus] 

Length =293 

Score = 264 bits (675), Expect = 4e-69 

Identities = 144/268 (53%), Positives = 182/268 (67%), Gaps = 15/268 (5%) 

LYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWEEQHRNYVSSLKRQIAQLKEELQE 88 
+Y+L C P+ D GKE YQA+LQE EE ++ +SLKRQIAQLK+ELQ+ 



SE++R Q + A G+G P + QA DLL FLHSQ+D+AEV+ G KL +EY 



VPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPAENSPNHRP- - 2 03 
VPF+SFTL KV+QLE GLTRHPEEKPVRKDKRDELVE IE+ +E +NNP E+ 



-YTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKL 256 
+ +DFIEG YRTERDKGT YEL FK E++ + LFRPF P+MKVKNE + 



Query: 


29 


Sbjct: 


29 


Query : 


89 


Sbjct: 


89 


Query : 


146 


Sbjct: 


146 


Query : 


204 


Sbjct: 


206 


Query : 


257 


Sbjct: 


266 



++ ++IN+IVPLA+R + F QFMQNFR 
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J > gi I 47201472 | emb | CAF87906 ♦ 1 1 unnamed protein product [Tetraodon nigroviridis] 
Length =245 

Score = 221 bits (563), Expect = 3e-56 

Identities = 127/309 (41%), Positives = 167/309 (54%), Gaps = 71/309 (22%) 

DELVEAIESALETLNNP AENSPNHR-PYTASDFIEGIYRTERDKGTLYELTFKG 230 

DELVE +E+AL+ +NNP E+ P R YT F EG+YRTERDKGTLYEL F 



F+ + LFRPF P+MKV++ + + +IN-I-IVPL+ RV+ F QF+ NFRE+CI 



D RVHLTWYFG+E + EVK LE S++ + ++ A FW+ 



F + + S+FLN + + S +N +A+PPL 
■ -FGFGMTCQYRSDFLNIGKRSRG LSLFNAEFTENGCEAIPPLA 202 



Query: 


178 


Sbjct : 


1 


Query : 


231 


Sbjct: 


61 


Query: 


291 


Sbjct: 


121 


Query: 


351 


Sbjct: 


161 


Query: 


411 


Sbjct: 


203 


Query : 


471 


Sbjct: 


237 



GGFDL++KGWG EDVHLYRKYL S+LIV+RTPV 
-GGFDLEVKGWGVEDVHLYRKYLRSDLIVIRTPVS 23 6 



LFHLWHEK 



J > gi I 37573 674 | db j | BAC98832 . 1 1 chondroitin sulfate synthase [Homo sapiens] 

Length = 882 

Score = 207 bits (527) , Expect = 5e-52 

Identities = 113/275 (41%), Positives = 160/275 (58%), Gaps - 21/275 (7%) 

INVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKE EINEVKGILEN 315 

++++VPL RDF +FM+NF MC+ V L F ++ I +KG 



KA T I + GEFSRG GL++ A+F ++ LL FCDVD+ F +FL CR N 



T G++V+YP++FSQY+P + G + PP + + K+TGFWRD+G+G+TC Y+SD + 



GGFD I+GWG EDV LY K + S L R+ G+ H++H C L P+QYKMC+ 



SKA AS QL L +E HL + +T S 

5SKASTFASTMQLAEL WLEKHLGVRYNRTLS 882 

J > qi I 4265727S | ref | XF 373440 . 2 | chondroitin sulfate synthase 3 [Homo sapiens] 

Length = 882 

Score = 207 bits (527) , Expect = 5e-52 

Identities = 113/275 (41%), Positives = 160/275 (58%), Gaps = 21/275 (7%) 

Query: 263 INVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKE EINEVKGILEN 315 

++++VPL RDF +FM+NF MC+ V L ++ F ++ I +KG 



Query : 


263 


Sbjct: 


621 


Query: 


316 


Sb j ct : 


681 


Query : 


375 


Sbjct: 


735 


Query : 


435 


Sbjct: 


792 


Query: 


495 


Sbjct: 


852 
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Sbjct: 621 VHILVPLIGRYDIFLRFMENFENMCLIPKQNVKLVIILFSRDSGQDSSKHIELIKGYQNK 680 

Query: 316 TSKAANFRNFTFIQLNGEFSRGKGLDVG-ARFWKGSNVLLFFCDVDIYFTSEFLNTCRLN 3 74 

KA T I + GEFSRG GL++ A+F ++ LL FCDVD+ F +FL CR N 

Sbjct: 681 YPKA EMTLIPMKGEFSRGLGLEMASAQF- -DNDTLLLFCDVDLIFREDFLQRCRDN 734 

Query: 375 TQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFI 434 

T G++V+YP++FSQY+P + G + PP + + K+TGFWRD+G+G+TC Y+SD + 
Sbjct: 735 TIQGQQVYYPIIFSQYDPKVTNGGN PPTDDYFIFSKKTGFWRDYGYGITCIYKSDLL 791 

Query: 43 5 NIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCM 4 94 

GGFD I-fGWG EDV LY K + S L R+ G+ H++H C L P+QYKMC+ 
Sbjct: 792 GAGGFDTSIQGWGLEDVDLYNKVILSGLRPFRSQEVGWHIFHPVHCDPNLDPKQYKMCL 851 

Query: 495 QSKAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSS 529 

SKA AS QL L +E HL + +T S 

Sbjct: 852 GSKASTFASTMQLAEL WLEKHLGVRYNRTLS 882 



^BA}S}.^A?/k^J:l§J!PA9f^^ chodroitin synthase 2 [Homo sapiens] 

Length = 882 

Score = 207 bits (526) , Expect = 6e-52 

Identities = 113/275 (41%) , Positives = 160/275 (58%) , Gaps = 21/275 (7%) 

Query: 263 INVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKE EINEVKGILEN 315 

++++VPL RDF +FM+NF MC+ V L ++ F ++ I +KG 

Sbjct: 621 VHILVPLIGRYDIFLRFMENFENMCLIPKQNVKLVIILFSRDSGQDSSKHIELIKGYQNK 680 

Query: 316 TSKAANFRNFTFIQLNGEFSRGKGLDVG-ARFWKGSNVLLFFCDVDIYFTSEFLNTCRLN 374 

KA T I + GEFSRG GL++ A+F ++ LL FCDVD+ F +FL CR N 

Sbjct: 681 YPKA EMTLIPMKGEFSRGLGLEMASAQF- -DNDTLLLFCDVDLIFREDFLQRCRDN 734 

Query: 375 TQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFI 434 

T G++V+YP++FSQY+P + G + PP -f + K+TGFWRD+G+G+TC Y+SD + 
Sbjct: 735 TIQGQQVYYPIIFSQYDPKVTNGGN- --PPTDDYFIFSKKTGFWRDYGYGITCIYKSDLL 791 

Query: 435 NIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCM 494 

GGFD I+GWG EDV LY K + S L R+ G+ H++H C L P+QYKMC+ 
Sbjct: 792 GAGGFDTSIQGWGLEDVDLYNKVILSGLRPFRSQEVGWHIFHPVHCDPNLDPKQYKMCL 851 

Query: 495 QSKAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSS 529 

SKA AS QL L +E HL + +T S 

Sbjct: 852 GSKANTFASTMQLAEL WLEKHLGVRYNRTLS 882 



U > gi I 27681143 I ref |XP 22S912 . 1 1 similar to carbohydrate (chondroitin) synthase 2 

norvegicus] 
Length =561 

Score = 206 bits (524) , Expect = le-51 

Identities = 112/271 (41%), Positives = 160/271 (59%), Gaps = 13/271 (4%) 

Query: 263 INVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYF GKEEINEVKGILENTSK 318 

++++VPL RDF +FM+NF CI V L ++ F G+E I ++ I E ++ 

Sbjct: 300 VHILVPLVGRYDIFLRFMENFESTCIIPKQNVKLVIILFSRDSGQESIKHIELIQEYQNR 359 

Query: 319 AANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPG 378 

+ T I + GEFSRG GL++G+ + ++ LL FCDVD+ F +FL CR NT G 
Sbjct: 360 YPS-AEMTLIPMKGEFSRGLGLEMGSSQFD-NDTLLLFCDVDLIFRGDFLQRCRDNTVQG 417 

Query: 379 KKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGG 438 

++V+YP++FSQY+P + + P E V KETGFWRD+G+G+TC Y+SD + GG 
Sbjct: 418 QQVYYPIIFSQYDPKVTRMGNS PTEGDFVFSKETGFWRDYGYGITCIYKSDLLGAGG 474 

Query: 439 FDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKA 4 98 
FD I+GWG EDV LY K + S L R+ G+ H++H C L P+QYKMC+ SKA 
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Sbjct: 475 FDTSIQGWGLEDVDLYNKVIQSGLRPFRSQEVGWHIFHPVHCDPNLDPKQYKMCLGSKA 534 

Query: 499 MNEASHGQLGMLVFRHEIEAHLRKQKQKTSS 529 

AS +L L +E HL + +T S 

Sbjct: 535 STFASTMKLAEL WLEKHLGVRDNRTLS 561 

Li > gi I 23411190 1 exrh \ CA.D43233 . 1 1 chondroitin synthase 2 [Homo sapiens] 

Length = 573 

Score = 206 bits (523), Expect = le-51 

Identities = 113/275 (41%), Positives = 160/275 (58%), Gaps = 21/275 (7%) 

Query: 263 INVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKE EINEVKGILEN 315 

++++VPL RDF +FM+NF MC+ V L ++ F ++ I +KG 

Sbjct: 312 VHILVPLIGRYDIFLRFMENFENMCLIPKQNVKLVIILFSRDSGQDSSKHIELIKGYQNK 371 

Query: 316 TSKAANFRNFTFIQLNGEFSRGKGLDVG-ARFWKGSNVLLFFCDVDIYFTSEFLNTCRLN 374 

KA T I + GEFSRG GL++ A+F ++ LL FCDVD+ F +FL CR N 

Sbjct: 372 YPKA EMTLIPMKGEFSRGLGLEMASAQF- -DNDTLLLFCDVDLIFREDFLQRCRDN 425 

Query: 375 TQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFI 434 

T G+-fV+YP++FSQY+P + G + PP + + K+TGFWRD+G+G+TC Y+SD + 
Sbjct: 426 TIQGQQVYYPIIFSQYDPKVTNGGN---PPTDDYFIFSKKTGFWRDYGYGITCIYPCSDLL 482 

Query: 435 NIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCM 494 

GGFD I+GWG EDV LY K + S L R+ G+ H++H C L P+QYKMC+ 
Sbjct: 483 GAGGFDTSIQGWGLEDVDLYNKVILSGLRPFRSQEVGWHIFHPVHCDPNLDPKQYKMCL 542 

Query: 495 QSKAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSS 529 

SKA AS QL L +E HL + +T S 

Sbjct: 543 GSKANTFASTMQLAEL WLEKHLGVRYNRTLS 573 

□ > qi 17959793 | qb | AAF71068 . 1 1 PRO0082 [Homo sapiens] 

Length = 109 

Score = 195 bits (496), Expect = 2e-48 

Identities = 84/106 (79%), Positives = 95/106 (89%) 

Query: 425 MTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDE 484 

MTCQYRSDF+ IGGFD+++KGWGGEDVHLYRKYLH +LIV+RTPV GLFHLWHEKRC DE 
Sbjct: 1 MTCQYRSDFLTIGGFDMEVKGWGGEDVHLYRKYLHGDLIVIRTPVPGLFHLWHEKRCADE 60 

Query: 485 LTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSSK 530 

LTPEQY+MC+QSKAMNEASH LGMLVFR EXE HL KQ +T+S+ 
Sbjct: 61 LTPEQYRMCIQSKAMNEASHSHLGMLVFREEIETHLHKQAYRTNSE 106 

O > gi [38086850 I ref I XP 194358. 2| similar to mKIAA0990 protein [Mus musculus] 

Length = 800 

Score = 192 bits (488), Expect = 2e-47 

Identities = 107/299 (35%), Positives = 168/299 (56%), Gaps = 21/299 (7%) 

Query: 215 RTERDKGTLYELTFKGDHKHEFKRLILFR-PFSPIMKVKNEKLNMANTLINVIVPLAKRV 273 

R +D G+L L+ + K+L+ F+ P S K E IN+++PL+ R 

Sbjct: 495 RINQDSGSLSFLS NSLKKLVAFQLPGS KTEHKEPKEKKINILI PLSGRF 543 

Query: 274 DKFRQFMQNFREMCIEQDGRVHLTWYFGKE EINEVKGILENTSKAANFRNFTFIQ 329 

D F +FM NF + C+ + V L ++ F + + +V+ + + K + + 

Sbjct: 544 DMFVRFMGNFEKTCLIPNLNVKLVILLFNSDSNPDKAKQVELMRDYRVKYPK-ADMQVLP 602 

Query: 33 0 LNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQ 389 

++G FSR L+VG+ + + LLFFCDVD+ FT+EFL CR NT G+++++P++FSQ 
Sbjct: 603 VSGGFSRALALEVGSSQFNNES-LLFFCDVDLVFTAEFLQRCRANTVLGQQIYFPIIFSQ 661 

Query: 390 YNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGE 449 



Z0of28 



6/23/04 1:42 PM 



RK)=1088011356-16594-155894407553.BLASTQ4, 



http://www.ncbi. nlmnih. go v/BLAST/Blast.cgi 



Y+P I+Y P + ++TGFWR++GFG+TC Y+ D + +GGFD+ I+GWG E 

Sbjct: 662 YDPKIVY SGKVPSDNHFAFTQKTGFWRNYGFGITCIYKGDLVRVGGFDVSIQGWGLE 718 

Query: 450 DVHLYRKYLHSNLIVVRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLG 508 

DV L+ K + + L R+ G+ H+ H C L P+QYKMC+ SKA S QL 
Sbjct: 719 DVDLFNKWQAGLKTFRSQEVGWHIHHPWCDPNLDPKQYKMCLGSKASTFGSTQQLA 777 



□ > gi I 37360174 | dbj \ RAC98065 . 1 1 mKIAA0990 protein [Mus musculus] 

Length =821 

Score = 192 bits (488) , Expect = 2e-47 

Identities = 107/299 (35%), Positives = 168/299 (56%), Gaps = 21/299 (7%) 

Query: 215 RTERDKGTLYELTFKGDHKHEFKRLILFR-PFSPIMKVKNEKLlSnviANTLIlWIVPLAKR 273 

R +D G+L L+ + K+L+ F+ P S K E IN+++PL+ R 

Sbjct: 516 RINQDSGSLSFLS NSLKKLVAFQLPGS KTEHKEPKEKKINILIPLSGRF 564 

Query: 274 DKFRQFMQNFREMCIEQDGRVHLTWYFGKE EINEVKGILENTSKAANFRNFTFIQ 329 

D F +FM NF + C+ + V L ++ F + + +V+ + -f K + + 

Sbjct: 565 DMFVRFMGNFEKTCLIPNLNVKLVILLFNSDSNPDKAKQVELMRDYRVKYPK-ADMQVLP 623 

Query: 33 0 LNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQ 389 

++G FSR L+VG+ + + LLFFCDVD+ FT+EFL CR NT G+++++P++FSQ 
Sbjct: 624 VSGGFSRALALEVGSSQFNNES-LLFFCDVDLVFTAEFLQRCRANTVLGQQIYFPIIFSQ 682 

Query: 390 YNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGE 449 

Y+P I+Y P + ++TGFWR++GFG+TC Y+ D + +GGFD+ I+GWG E 

Sbjct: 683 YDPKIVY SGKVPSDNHFAFTQKTGFWRNYGFGITCIYKGDLVRVGGFDVSIQGWGLE 739 

Query: 450 DVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLG 508 

DV L+ K + + L R+ G+ H+ H C L P+QYKMC+ SKA S QL 
Sbjct: 740 DVDLFNKWQAGLKTFRSQEVGWHIHHPWCDPNLDPKQYKMCLGSKASTFGSTQQLA 798 

O > gi 1 18606331 1 gb | AAH23112 . 1 1 Galnact2 protein [Mus musculus] 

Length =109 

Score = 191 bits (486), Expect = 3e-47 

Identities = 81/106 (76%), Positives = 96/106 (90%) 

Query: 425 MTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDE 484 

MTCQY+SDF+++GGFD+++KGWGGEDVHLYRKYLH +LIV+RTPV GLFHLWHEK C DE 
Sbjct: 1 MTCQYQSDFLSVGGFDMEVKGWGGEDVHLYRKYLHGDLIVIRTPVPGLFHLWHEKHCADE 60 

Query: 485 LTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSSK 530 

LTPEQY+MC+QSKAMNEASH LGM+VFR EIE HLRKQ +T+S+ 
Sbjct: 61 LTPEQYRMCIQSKAMNEASHSHLGMMVFREEIEMHLRKQAYRTNSE 106 

iJ > gi N7086685 I re£ I MP 997843 -ij carbohydrate (chondroitin) synthase 1; wu:fc27h0 

g i i 40 3 527 10 jgbiAAH64670. 1 j Carbohydrate (chondroitin) synthase 1 [Danio rerio] 

Length" =801 

Score = 187 bits (475) , Expect - 5e-46 

Identities = 101/281 (35%), Positives = 159/281 (56%), Gaps = 13/281 (4%) 

Query: 234 HEFKRLILFRPFSPIMKVKNEKLNMANTLIOTIVPIAKRVDKFRQFMQNFREMCIEQDGR 293 

+ K L+ F+ SP ++ K IN++VPLA R + F +FM NF ++C+ + 

Sbjct: 507 NSLKMLVPFKLSSPGIEQHEPK EKKINILVPLAGRYEIFLRFMANFEKICLIPNQN 562 

Query: 294 VHLTWYFGK EEINEVKGILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKG 349 

V L ++ F EI +++ + E K + ++G FSR L+VG+ + 

Sbjct: 563 VKLLILLFSTDNNTERIKQIELMREYRMKYPK-ADMEIKPVSGPFSRALALEVGSAHFT- 620 

Query: 350 SNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQL 409 
++ LLF+CDVD+ FT +FL CR NT G++ ++P++FSQY+P ++Y P + 
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Sbjct: 621 NDSLLFYCDVDLLFTPDFLTRCRGNTILGEQTYFPIIFSQYDPKVVYA---GKVPSDNHY 677 

Query: 410 VIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPV 469 

V +TG WR +GFG+ C Y+ D + GGFD+ I+GWG EDV L+ K++ S + + R+ 
Sbjct: 678 VFTSKTGLWRHYGFGIVCVYKGDLVKAGGFDVSIQGWGLEDVDLFNKFVQSGIKLFRSTD 737 

Query: 470 RGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGML 510 

G+ H+ H C L P+QYKMC+ SKA + S QL L 
Sbjct: 738 TGIVHVHHPWCDPNLDPKQYKMCLGSKASSHGSTQQLAEL 778 

LI >gij 47213615 ) ert^b | CA.F95956 . lJ unnamed protein product [Tetraodon nigroviridis] 

Length = 838 

Score = 186 bits (473), Expect = 9e-46 

Identities = 96/252 (38%), Positives = 143/252 (56%), Gaps = 9/252 (3%) 

Query: 263 INVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGK EEINEVKGILENTSK 318 

+N++VPL+ RDF +FM NF +C+ + V L V+ F E + +V+ + E K 

Sbjct: 570 VNILVPLSGRYDIFVRFMANFERICLIPNQNVKLLVLLFNTDNNTERVKQVELMREYHMK 629 

Query: 319 AANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPG 378 

+ G FSR L+VG+ + ++ LLF+CDVD+ FTSEFL CR NT G 
Sbjct: 63 0 YPR-AEMEIRPVTGSFSRALALEVGSLHFS-NDSLLFYCDVDLLFTSEFLKRCRANTALG 687 

Query: 379 KKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGG 438 

+ ++P++FSQY+P ++Y P V +TG WR++GFG+ C Y+ D + GG 

Sbjct: 688 AQAYFPIIFSQYPPKVWA---GKVPSimiYVFTAKTGLWRNYGFGIVCVYKGDLVRAGG 744 

Query: 439 FDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKA 498 

FD I+GWG EDV L+ K++ S + + R+ G+ H+ H C L +QYKMC+ SKA 
Sbjct: 745 FDTSIQGWGLEDVDLFNKFVQSGIKLFRSTDTGIVHIHHPVICDPNLEAKQYKMCLGSKA 804 

Query: 4 99 MNEASHGQLGML 510 

+ S QL L 
Sbjct: 805 SSHGSTQQLAEL 816 



[J > gi I 40789012 | dhj \ aAA76834 . 2 | KIAA0990 protein [Homo sapiens] 

Length = 883 

Score =: 186 bits (473), Expect = 9e-46 

Identities = 106/297 (35%), Positives = 169/297 (56%), Gaps = 17/297 (5%) 

Query: 215 RTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMANTLINVIVPLAKRVD 274 

R ++ G+L L+ + K+L+ F+ + K+E + IN+++PL+ R D 

Sbjct: 577 RINQESGSLSFLS NSLKKLVPFQ LPGSKSEHKEPKDKKINILIPLSGRFD 626 

Query: 275 KFRQFMQNFREMCIEQDGRVHLTVVYFGKEE-INEVKGILENTSKAANF--RNFTFIQLN 331 

F +FM NF + C+ + V L V+ F + ++ K + T + + + ++ 

Sbjct: 627 MFVRFMGNFEKTCLIPNQNVKLWLLFNSDSNPDKAKQVELMTDYRIKYPKADMQILPVS 686 

Query: 332 GEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQYN 391 

GEFSR L+VG+ + + LLFFCDVD+ FT+EFL CR NT G+++++P++FSQY+ 
Sbjct: 687 GEFSRALALEVGSSQFNNES-LLFFCDVDLVFTTEFLQRCRANTVLGQQIYFPIIFSQYD 745 

Query: 3 92 PGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDV 451 

P I+Y P + ++TGFWR++GFG+TC Y+ D + +GGFD+ I+GWG EDV 

Sbjct: 746 PKIVY SGKVPSDNHFAFTQKTGFWRNYGFGITCIYKGDLVRVGGFDVSIQGWGLEDV 802 

Query: 452 HLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLG 508 

L+ K + + L R+ G+ H+ H C L P+QYKMC+ SKA S QL 
Sbjct: 803 DLFNKWQAGLKTFRSQEVGWHVHHPVFCDPNLDPKQYKMCLGSKASTYGSTQQLA 859 

tJ > gi 1 15617453 | dbj | BA364 936 . 1 1 chondroitin synthase [Homo sapiens] 

Length =802 
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Score = 186 bits (472), Expect = le-45 

Identities = 106/297 (35%), Positives = 169/297 (56%), Gaps = 17/297 (5%) 

Query: 215 RTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKI.NMANTLINVIVPLAKRVD 274 

R ++ G+L L+ + K+L+ F+ + K+E + IN+++PL+ R D 

Sbjct: 496 RINQESGSLSFLS NSLKKLVPFQ LPGSKSEHKEPKDKKINILIPLSGRFD 545 

Query: 275 KFRQFMQNFREMCIEQDGRVHLTWYFGKEE-INEVKGILENTSKAANF'-RNFTFIQLN 331 

F +FM NF + C+ + V L V+ F + ++ K + T + + + ++ 

Sbjct: 546 MFVRFMGNFEKTCLIPNQWKLVVLLFNSDSNPDKAKQVELMTDYRIKYPKADMQILPVS 605 

Query: 332 GEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQYN 391 

GEFSR L+VG+ + + LLFFCDVD+ FT+EFL CR NT G+++++P++FSQY+ 
Sbjct: 606 GEFSRALALEVGSSQFNNES-LLFFCDVDLVFTTEFLQRCRANTVLGQQIYFPIIFSQYD 664 

Query: 392 PGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDV 4 51 

P I+Y P + ++TGFWR++GFG+TC Y+ D + +GGFD+ I+GWG EDV 

Sbjct: 665 PKIVY---SGKVPSDNHFAFTQKTGFWRNYGFGITCIYKGDLVRVGGFDVSIQGWGLEDV 721 

Query: 452 HLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLG 508 

L+ K + + L R+ G+ H+ H C L P+QYKMC+ SKA S QL 
Sbjct: 722 DLFNKWQAGLKTFRSQEVGWHVHHPVFCDPNLDPKQYKMCLGSKASTYGSTQQLA 778 

LJ >gi I 31542309 | ref [ NF _0 5S7 33 . 2 | carbohydrate (chondroitin) synthase 1; chondroit 

sapiens] 

gi j 28279346 i gb j AAH4 62 47 . 1 \ Carbohydrate (chondroitin) synthase 1 [Homo sapiens] 

Length =""802 

Score = 185 bits (470), Expect = 2e-45 

Identities = 106/298 (35%), Positives = 170/298 (57%), Gaps = 19/298 (6%) 

Query: 215 RTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMANTLIWIVPIAKRVD 274 

R ++ G+L L+ + K+L+ F+ + K+E + IN+++PL+ R D 

Sbjct: 496 RINQESGSLSFLS NSLKKLVPFQ LPGSKSEHKEPKDKKINILIPLSGRFD 545 

Query: 275 KFRQFMQNFREMCIEQDGRVHLTWYFGKE EINEVKGILENTSKAANFRNFTFIQL 330 

F +FM NF + C+ + V L V+ F + + -fV+ + + K + + + 

Sbjct: 546 MFVRFMGNFEKTCLIPNQNVKLWLLFNSDSNPDKAKQVELMRDYRIKYPK-ADMQILPV 604 

Query: 331 NGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 3 90 

+GEFSR L+VG+ + + LLFFCDVD+ FT+EFL CR NT G+++++P++FSQY 
Sbjct: 605 SGEFSRALALEVGSSQFNNES-LLFFCDVDLVFTTEFLQRCRANTVLGQQIYFPIIFSQY 663 

Query: 3 91 NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGED 450 

+P I+Y P + ++TGFWR++GFG+TC Y+ D + +GGFD+ I+GWG ED 

Sbjct: 664 DPKIVY SGKVPSDNHFAFTQKTGFWRNYGFGITCIYKGDLVRVGGFDVSIQGWGLED 720 

Query: 451 VHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLG 508 

V L+ K + + L R+ G+ H+ H C L P+QYKMC+ SKA S QL 
Sbjct: 721 VDLFNKWQAGLKTFRSQEVGWHVHHPVFCDPNLDPKQYKMCLGSKASTYGSTQQLA 778 



□ >gi I 3 7132131 |gb I AAQ83393 , 1 1 CHSYl [Homo sapiens] 

Length = 802 " 

Score = 185 bits (470) , Expect = 2e-45 

Identities = 106/298 (35%), Positives = 170/298 (57%), Gaps = 19/298 (6%) 

Query: 215 RTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMANTLINVIVPLAKRVD 274 
R ++ G+L L+ + K+L+ F+ + K+E + IN+++PL+ R D 

Sbjct: 496 RINQESGSLSFLS NSLKKLVPFQ LPGSKSEHKEPKDKKINILIPLSGRFD 545 

Query: 275 KFRQFMQNFREMCIEQDGRVHLTWYFGKE EINEVKGILENTSKAANFRNFTFIQL 330 

F +FM NF + C+ + V L V+ F + + +V+ + + K + + + 

Sbjct: 546 MFVRFMGNFEKTCLIPNQNVKLWLLFNSDSNPDKAKQVELMRDYRIKYPK-ADMQILPV 604 
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Query: 331 NGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 3 90 

+GEFSR L+VG-f + + LLFFCDVD+ FT+EFL CR NT G+++++P++FSQY 
Sbjct: 605 SGEFSRALALEVGSSQFNNES-LLFFCDVDLVFTTEFLQRCRANTVLGQQIYFPIIFSQY 663 

Query: 391 NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGED 450 

+P I+Y P + ++TGFWR++GFG+TC Y+ D + +GGFD+ I+GWG ED 

Sbjct: 664 DPKIVY SGKVPSDNHFAFTQKTGFWRNYGFGITCIYKGDLVRVGGFDVSIQGWGLED 720 

Query: 451 VHLYRKYLHSl^IVWTPWGLFHLWHEKRCMDELTPEQYKMCMQSKAl^EASHGQM 508 

V L+ K + + L R+ G+ H+ H C L P+QYKMC+ SKA S QL 
Sbjct: 721 VDLFNKWQAGLKTFRSQEVGWHVHHPVFCDPNLDPKQYKMCLGSKASTYGSTQQLA 778 



>aiixi^i^.7i^5li£^:jLl^£^ similar to carbohydrate (chondroitin) synthase 1 

norvegicus] 
Length = 799 

Score = 183 bits (465) , Expect - 7e-45 

Identities = 106/298 (35%), Positives = 168/298 (56%), Gaps = 19/298 (6%) 

Query: 215 RTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMANTLINVIVPIAK^ 274 

R +D G+L L+ + K+L+ F+ M+ K K IN+++PL+ R D 

Sbjct: 495 RINQDSGSLSFLS NSLKKLVPFQLPGSKMEHKEPK EKKINILIPLSGRFD 544 

Query: 275 KFRQFMQNFREMCIEQDGRVHLTWYFGKE EINEVKGILENTSKAANFRNFTFIQL 330 

F +FM NF + C+ + V L ++ F + + +V+ + + K + + + 

Sbjct: 545 MFVRFMGNFEKTCLIPNLNVKLVILLFNSDSNPDKAKQVELMRDYRVKYPK-ADMQVLPV 603 

Query: 331 NGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 3 90 

+G FSR L+VG+ + + LLFFCDVD+ FT+EFL CR NT G+++++P++FSQY 
Sbjct: 604 SGGFSRALALEVGSSQFSNES-LLFFCDVDLVFTAEFLQRCRANTVLGQQIYFPIIFSQY 662 

Query: 391 NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGED 450 

+P I+Y P + ++TGFWR++GFG+TC Y+ D + +GGFD+ I+GWG ED 

Sbjct: 663 DPKIVY- - -SGKVPSDNHFAFTQKTGFWRNYGFGITCIYKGDLVRVGGFDVSIQGWGLED 719 

Query: 451 VHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLG 508 

V L+ K + + L R+ G+ H+ H C L P+QYKMC+ SKA S QL 
Sbjct: 720 VDLFNKWQAGLKTFRSQEVGWHVHHPVFCDPNLDPKQYKMCLGSKASTFGSTQQLA 777 

L; > gi I 41210270 | ref | XP 210054 . 3 | similar to dJ341D10,l (novel protein) [Homo sapi 

gi i 17736645 j \ C:;ad19074 . 1 \ dJ341D10.1 (novel protein) [Homo sapiens] 
Length = 109 

Score = 174 bits (442) , Expect - 3e-42 

Identities = 77/106 (72%), Positives = 90/106 (84%) 

Query: 425 MTCQYRSDFINIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDE 484 

MTCQYRSDF+ IG FD+++KGWGGEDVHL +KYLH +LIV+ TPV GLFHLW EKRC DE 
Sbjct: 1 MTCQYRSDFLIIGRFDMEVKGWGGEDVHLCQKYLHGDLIVIWTPVTGLFHLWPEKRCADE 60 

Query: 485 LTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSSK 53 0 

LTPEQY+MC+QSKA+NEAS LGMLVFR EXE HL KQ +T+S+ 
Sbjct: 61 LTPEQYRMCIQSKAINEASRSHLGMLVFREEIETHLHKQAYRTNSE 106 



iU > gi I 48139301 1 ref I XP 396991. l| similar to chondroitin synthase- like protein [Ap 

Length = 1094 

Score = 174 bits (442) , Expect = 3e-42 

Identities = 87/258 (33%), Positives = 146/258 (56%), Gaps = 8/258 (3%) 

Query: 260 NTLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYF- - -GKEEINEVKGILENT 316 

+ +++ I+PL+ R + FR+F+QN+ E+C+ R L ++ + + N ++E 
Sbjct: 549 DKMVHFILPLSGRYEIFRRFLQNYEEICLTSGERTALLIMLYHHRTENSFNRTIDLVERL 608 

Query: 317 SKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQ 376 
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+ I ++G FSR K L+ G K +N L+ F DVDI FT L R+NT 
Sbjct: 609 KYKYRSASIDIIPISGTFSRAKALNYGVSRLK-NNDLMLFIDVDIAFTESALYRIRVNTL 667 

Query: 377 PGKKVFYPVLFSQYNPGI lYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINI 436 

G+++-f -fPV+FSQY+P I+YG DP I + G+WR FGFG+ Y+ D+ ++ 

Sbjct: 668 LGRQMYFPWFSQYDPKIVYG--DTKSP--DTFAINEMAGYWRQFGFGIVSLYKQDYKHV 723 

Query: 437 GGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQS 496 

GGF+L I+GWG EDV Y + + S++ + R +L H++H+ C +L+ Q+ MCM + 
Sbjct: 724 GGFNLSIQGWGKEDVDFYERVIKSSIKIFRAADKDLVHVYHDVECSKDLSETQWSMCMGT 783 

Query: 497 KAMNEASHGQLGMLVFRH 514 

KA A L +++ + 
Sbjct: 784 KADTLAGTETLAQMIYEN 801 



IJ >9i I 4 2662609 I r ef | XP_37ei28 . 1 1 similar to dJ19Nl.l (novel protein) [Homo sapien 

gi I 986J54l]"ernb"j (:!AC0414'ir'lT "" " ""dJ19Nl.l (novel protein) [Homo sapiens] 
Length = 93 

Score = 149 bits (375) , Expect = 2e-34 
Identities = 65/90 (72%), Positives = 76/90 (84%) 

Query: 441 LDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMN 500 

+++KGWGGEDVHLYRKYL +LIV+R PV GLFHLWHEK C DELTPEQY+MC+QSKAM+ 
Sbjct: 1 MEVKGWGGEDVHLYRKYLRGDLIVII^PVPGLFHLWHEKHCADELTPEQYRiyiCIQSKAMD 60 

Query: 501 EASHGQLGMLVFRHEIEAHLRKQKQKTSSK 530 

EAS LGMLVFR EIE HL KQ +T+S+ 
Sbjct: 61 EASRSHLGMLVFREEIETHLHKQAYRTNSE 90 



U > gi I 31158517 I gb | AAG8S275 . 1 1 
Length = 827 



chondroitin synthase-like protein [Drosophila melanc 



Score = 136 bits (343), E^qject = le-30 

Identities = 85/267 (31%), Positives = 132/267 (49%), Gaps = 32/267 (11%) 



Query : 


263 


Sb j ct : 


522 


Query : 


322 


Sbjct: 


582 


Query : 


380 


Sbjct: 


641 


Query : 


438 


Sbjct: 


701 


Query: 


472 


Sbjct: 


761 



++P+A R+ F +F++ + +C+ 



L W FG +E+ + 



+L + 



++ +IQ + FSRG LDV AR + + +++LF DVD+ F E L R++TQ GK 



+V+ P++FSQY+P 



GFD DI GWG EDV 



I E G++R FGFG+ Y+SD + +1 



K + 



-SNLIWRTPVRG 471 
L V R P 



L H++H+ C +L QY MC+ +KA 



□ > gi|4555519l|ref |NP 996440. l| CG9220-PC [Drosophila melanogaster] 



gi j 4544 6963 | gb | AAS65341 . 1 



CG9220-PC [Drosophila melanogaster] 



Length = 863 



Score = 136 bits (343), Expect = le-30 

Identities = 85/267 (31%), Positives = 132/267 (49%), Gaps = 32/267 (11%) 

Query: 263 IWIVPIAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGK-EEINEVKGILENTSKAAN 321 
I ++P+A R+ F +F++ + +C+ + L W FG +E+ + +L + 
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Sbjct: 558 IVFVLPIAGRLGTFERFLRTYERVCVRGEQHCDLLWIFGSPDELGDHLQLLHDLHARHV 617 

FRNFTFIQLNGEFSRGKGLDVGAR- -FWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGK 379 
++ +IQ + FSRG LDV AR + + +++LF DVD+ F E L R++TQ GK 
YQQVNWIQRSSAFSRGVALDVAARSSYIRQEDI ILFI -DVDMVFEVETLQRVRMHTQRGK 676 

KVFYPVLFSQYNPGI lYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFI - -NIG 43 7 
+V+ P++FSQY+P G + I E G++R FGFG+ Y+SD + +1 

QVYLPIVFSQYDPQRRSGDAGGSEDEGETPRIDDERGYFRQFGFGICAIYKSDILDEDIN 73 6 

GFDLD I KGWGGE DVHLYRKYLH SNLI WRT PVRG 471 

GFD DI GWG EDV K + L V R P 

GFDKDITGWGLEDVKFLEKIVRVGTRQRGFLANTAELAMDYNEAAEQWRRLSVFRAPDPT 796 

LFHLWHEKRCMDELTPEQYKMCMQSKA 498 
L H++H+ C +L QY MC+ +KA 



Sbj Ct : 


558 




322 


Sbj ct : 


618 


Queiry : 


380 


Sbjct : 


677 




438 


Sbjct: 


737 


Query: 


472 


Sbjct: 


797 



^ >gAl j 0 ] ^ef I NP_99643 9 . 1 [ CG9220-PB [Drosophila melanogaster] 



gi j 22832276 = gb | AAF48479 . 2 | CX39220-PB [Drosophila melanogaster] 



Length = 608 
Score = 135 bits (341), Expect = 2e-30 

Identities = 85/267 (31%), Positives = 132/267 (49%), Gaps = 32/267 (11%) 

/I VPLAKRVDKFRQFMQNFREMC I EQDGRVHLTWYFGK- EE I NEVKG I LENTS KAAN 3 2 : 
++P-fA R+ F +F++ + +C+ + L W FG +E+ + +L + 



++ +IQ + FSRG LDV AR + + +++LF DVD+ F E L R++TQ GK 



+V+ P++FSQY+P G + I E G++R FGFG+ Y+SD + +1 



GFD DI GWG EDV K + L V R P 



L H++H+ C +L QY MC+ +KA 
LVHIYHDISCDVQLDAPQYNMCLGTKA 568 

LJ >gi [31153514 I gb I AA035273 . l| chondroitin disaccharide polymerase [Caenorhabditis 
Length = "73T 

Score = 134 bits (336), Expect = 6e-30 

Identities = 82/245 (33%), Positives = 134/245 (54%), Gaps = 10/245 (4%) 

Query: 263 INVIVPLAKRVDKFRQFMQNFREMCIE--QDGRVHLTVVYFGKEEINEVKGILENTSKAA 320 

+++I+PL R F +F Q+ + +C D V LT+V + E+ E + +E A 

Sbjct: 484 LHMIMPLRGRAAIFARFAQHLKSICARGGDDLAVSLTIVLYSSEDEMENRETIEMLR--A 541 

Query: 321 NFRNFTFIQLNG-EFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGK 379 

+F T I++ FSRG L GA +N LLFF DVD+ FT + L + NT 

Sbjct: 542 SFIPVTVIEMGDVSFSRGVALMRGAETLP-ANALLFFTDVDMLFTCDALKRIKSNTILNA 600 

Query: 380 KVFYPVLFSQYNPGI I YGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGF 439 

++++P++FS+++ + +D + L + G++R FG+G+ Y++D +++GGF 

Sbjct: 601 QIYFPIVFSEFSHES-WSENDKL--LADAFHYGRGRGYFRHFGYGLAAMYKADLMDVGGF 657 

Query: 440 DLDIKGWGGEDVHLYRKYL-HSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKA 498 
D I+GWG EDV L+ K + + L V+R P GL H++H C + + Q MC SKA 



Query: 


263 


Sbjct: 


303 


Query : 


322 


Sbjct: 


363 


Query : 


380 


Sbjct: 


422 


Query: 


438 


Sbjct: 


482 


Query : 


472 


Sbjct: 


542 
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RID=1 0880 11356-1 6594-1 55894407553.BLASTQ4, 



http://www.iicbi.nlm.Dih. gov/BLAST/Blast.cgi 



Sbjct: 658 DTKIEGWGKEDVDLFEKAIKNGRLRVIRVPEPGLVHIYHPIHCDENMPTAQKDMCHGSKA 717 

Query: 499 MNEAS 503 
+ AS 

Sbjct: 718 ASLAS 722 



J > gi I 3 1044074 | dbj | BAC7678Q . 1 1 chondroitin synthase [Caenorhabditis elegans] 
Length = 726 

Score = 134 bits (336), Expect = 6e-30 

Identities = 82/245 (33%), Positives = 134/245 (54%), Gaps = 10/245 (4%) 

INVIVPLAKRVDKFRQFMQNFREMCIE- -QDGRVHLTWYFGKEEINEVKGILENTSKT^A 320 
+++I+PL R F +F Q+ + +C D V LT+V + E+ E + +E A 



+F T I++ FSRG L GA +N LLFF DVD+ FT + L + NT 



++++P++FS+++ + +D + L + G++R FG+G+ Y++D +++GGF 



I+GWG EDV L+ K + + L V+R P GL H++H C + + Q MC SKA 



+ AS 



Query: 


263 


Sbjct: 


476 


Query : 


321 


Sbjct : 


534 


Query : 


380 


Sbjct: 


593 


Query : 


440 


Sbjct: 


650 


Query : 


499 


Sbjct : 


710 



Get selected sequences ^ j Select all \ \ Deselect all 



Database: All non-redundant GenBank CDS 

translations+PDB+SwissProt+PIR+PRF excluding environmental samples 

Posted date: Jun 23, 2 004 12:06 AM 
Number of letters in database: 619,474,291 
Number of sequences in database: 1,866,121 

Lambda K H 

0,321 0.137 0,410 

Gapped 

Lambda K H 

0-267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 129,716,455 

Number of Sequences: 1866121 

Number of extensions: 5644366 

Number of successful extensions: 16471 

Number of sequences better than 10.0: 50 

Number of HSP's better than 10.0 without gapping: 25 

Number of HSP's successfully gapped in prelim test: 25 

Number of HSP's that attempted gapping in prelim test: 16391 

Number of HSP's gapped (non-prelim): 65 

length of query: 532 

length of database: 619,474,291 

effective HSP length: 13 0 

effective length of query: 402 
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RE)=1088011356-16594-155894407553.BLASTQ4, 



http://www,i3cbi.nliiinih.gov/BLAST/Blast.cgi 



effective length of database: 376,878,561 
effective search space: 151505181522 
effective search space used: 151505181522 



T: 


11 








A: 


40 








XI 


16 


( 7 


.4 


bits) 


X2 


38 


{14 


.6 


bits) 


X3 


64 


{24 


.7 


bits) 


SI 


41 


{21 


.8 


bits) 


S2 


76 


{33 


9 bits) 
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